dplyr Summarize

Data ManipulationdplyrFree Lesson

Advertisement

Introduction

The summarize() function creates summary statistics from data. It's essential for aggregations.

Basic Summaries

library(dplyr)

df <- tibble(
  category = c("A", "B", "A", "B", "A"),
  value = c(10, 20, 30, 40, 50)
)

# Single summary
summarize(df, total = sum(value))

# Multiple summaries
summarize(df,
          total = sum(value),
          mean = mean(value),
          count = n())

Common Functions

summarize(df,
          sum = sum(value),
          mean = mean(value),
          median = median(value),
          sd = sd(value),
          min = min(value),
          max = max(value),
          n = n())

Grouped Summaries

df %>%
  group_by(category) %>%
  summarize(
    total = sum(value),
    mean = mean(value),
    count = n()
  )

Summary

summarize() creates aggregated statistics. Combine with group_by() for grouped summaries.

Advertisement

Need Expert R Programming Help?

Get personalized tutoring, project support, or professional consulting.

Advertisement