Graphs are visual representations of data that help in understanding relationships, trends, and distributions.
| Purpose | Best Graph |
|---|---|
| Compare categories | Bar chart, Pie Chart |
| Show distribution | Histogram, Box plot, Violin plot |
| Show trends over time | Line graph, Area chart |
| Show relationships/correlations | Scatter plot, Bubble chart |
| Visualize patterns in data | Heatmap |
Purpose: Compare quantities of categorical data.
When to Use:
Unordered Bar Chart: Use for simple category visualization.
Ordered Bar Chart: Highlights the most/least common categories.
Horizontal Bar Chart: Improves readability for long category names.
Grouped Bar Chart: Compares subcategories across main categories.
Stacked Bar Chart: Shows composition of each category.
Purpose: Display the distribution of numerical data.
When to Use:
Bins: The range of values divided into intervals. The width of these intervals determines the bin size.
Axis:
Basic Histogram: General overview of distribution.
Custom Bin Width: Fine-tuning for specific data patterns.
Density Overlaid: Assess distribution shape visually.
Grouped Histogram: Compare distributions across subgroups.
Faceted Histogram: Side-by-side comparison of subgroups.
Cumulative Histogram: Understand cumulative trends in the data.
Purpose: Show trends over time or sequences.
When to Use:
Basic Line Graph: Single line showing trends for one variable.
Multiple Line Graphs: Comparing trends of multiple variables.
Smoothed Line Graph: Adds a smooth curve to depict trends better.
Line Graph with Points: Shows data points explicitly.
Faceted Line Graph: Separate panels for different groups or variables.
Step Line Graph: Shows data changes stepwise.
Purpose: Show relationships or correlations between two numerical variables.
Scatter plots are commonly used to:
X-axis: Represents one variable (independent or predictor).
Y-axis: Represents another variable (dependent or outcome).
Points: Each point represents one observation.
Purpose: Represent proportions of a whole.
When to Use:
Limitations:
Basic Pie Chart Represent proportions of categories in a single dataset.
With Percentages Display proportions explicitly for easier interpretation.
Donut Chart Aesthetic variation of a pie chart, often used in dashboards.
Faceted Pie Chart Compare distributions across multiple groups.
Exploded Pie Chart Highlight a specific slice for emphasis.
## Box Plot
Purpose: Summarize the distribution of data through quartiles.
When to Use:
Minimum: The smallest data point excluding outliers.
First Quartile (Q1): The median of the lower half of the dataset.
Median (Q2): The middle value of the dataset.
Third Quartile (Q3): The median of the upper half of the dataset.
Maximum: The largest data point excluding outliers.

Basic Box Plot: Summarizing distribution of a single variable.
Box Plot by Category: Comparing distributions across categories.
With Outliers Highlighted: Identifying potential outliers.
Horizontal Box Plot: Improving readability for long category names.
Grouped Box Plot: Comparing multiple groups within categories.
Violin and Box Plot Combination: Adding detailed distribution insights to a box plot.
Customized Box Plot: Enhancing visual appeal for reports or presentations.
Purpose: Represent data in a matrix format using color coding.
When to Use:
Basic Heat Map Visualizing relationships between categories and values.
Heat Map with Labels Adding specific values for easier interpretation.
Correlation Heat Map Visualizing pairwise correlations between variables.
Time-Series Heat Map Tracking changes over time for multiple categories.
Diverging Heat Map Highlighting deviations from a baseline or midpoint.
