How to read a histogram, min, max, median & mean (2024)

How to read a histogram, min, max, median & mean (1)

Datawrapper offers powerful tools to understand your numeric data. If you are in step 2: Describe, you canclick on the header of any column with numbers to display a histogram of values, including their minimum, maximum, median, mean, and the number of potentially invalid values. Here's a quick explanation of what all of those words mean. Scroll down to get more in-depth explanations with examples.

  • Value distribution or histogram: Shows how the values in your column are distributed. The higher the bar, the more values fall in that range.
  • Min and max: Shows you the lowest (minimum) and highest (maximum) values in your column.
  • Mean: Also called the average. The sum ofall the values in your column divided by the total number of values.
  • Median: The number that would be in the middle of an ordered list of your values. Unlike the mean, the median isn't affected by outlier values.

How to read a histogram, min, max, median & mean (2)

Let's introduce some sample data. We have 101 rows in each column. The column "Berries" is filled with numbers from 0 to 100. The column "Lemons" is filled with numbers from 0 to 95 and with five "5000"s. The column "Apples" is filled with random data between 0 and 100.

Berries
Lemons Apples
0 0 58.32864768
1 1 48.45585765
2 2 94.03562691
3 ... 12.33567972
4 94 62.16973429
5 95 33.46912268
... 5000 7.506870755
97 5000 45.94068342
98 5000 29.94451574
99 5000 88.70147353
100 5000 51.67053301

Value distribution (histogram)

The histogram is a chart that tells us how the values in the selected column are distributed. This is great for understanding which values occur more or less often: Which salaries are most common, which survey replies were chosen the least, or which range of unemployment rates most counties have to deal with.

Here's how it works: A histogram automatically creates equally-sized bins between our lowest and highest values and tells us how many values fall within each bin. The higher the bar, the more values fall in that bin.

That's the histogram of our random data (Apples):

How to read a histogram, min, max, median & mean (3)

We can see that most of the values in this column fall between 80 and 90.

Attention: The values 80 and 89.99999 counts as part of this range between 80 and 90. But the value 90 counts as part of the last range; the one between 90 and 100. This is true for all values except the highest value, 100. To not create a new range just for this one value, 100 counts as part of the 90 to 100 range. We can see this better when we ask Datawrapper to show us the histogram for Berries:

How to read a histogram, min, max, median & mean (4)

Berries contains 101 values. The values 0 to 9.999 are in the first range; the values 10 to 19.999 are in the second range and the values 90 to 99.9999 plus the value 100 are in the last range. You can hover over the individual bars to check how many values fall in each range.

To understand that a histogram creates equally-sized ranges between the lowest and the highest value, let's look at the Lemons column. Because of its five outlier values of 5000, all the smaller values between 0 and 100 fall in a single 0-500 range:

How to read a histogram, min, max, median & mean (5)

Min and Max

Themin and max tell you your column's lowest and highest values. This is pretty straightforward: The min for Berries and Lemons is zero; the max for Berries is 100, and the max for Lemons is 5000.

Mean and Median

Themean is the average that most of us are familiar with. It gets calculated like this: First, we sum up all our values. In the case of Berries, that's 0+1+2+3+4+5+...+99+100 =5050. Then, we divide them by the numberof values. We have 101 values in our Berries column, and 5050 divided by 101 is 50. That's ourmean: 50.

Calculating themedianis even simpler: We sort all the values in our head from low to high (1, 2, 3, 4, 5, ..., 99, 100). Then, we check the value in the center of that list. This value is our median. In the case of Berries, that's 50: The column has the same number of values lower than 50 and higher than 50.

For our Berries column, the mean and the median are the same number: 50. That's because the values in this column are very evenly distributed. Let's look at the Lemons column instead: How to read a histogram, min, max, median & mean (6)

The mean of Lemons is 293(!), but the median isstill 50. That's because, when calculating the median, it doesn't matter if the higher values are very close to it or thousands of numbers away. The median just sorts all of the values and checks which one is in the middle. That's why the median is a useful measure: it ignoresoutliers.

This can be useful in many cases. For example, let's imagine a company with 101 employees, each one earning between $0 and $95. The company also hasfive bosses, who earn $5000 each. It wouldn't be fair to say that every person in this company makes about $300 (the mean salary). It would be closer to the truth to say that everyone earns about $50 (the median salary). When it comes to salaries, the median literally tells us what the "person in the middle" makes.

Pro tip: You can hover over the values for min, max, mean, and median to display them in the histogram: How to read a histogram, min, max, median & mean (7)

Invalid values

"Invalid values" is a number we only show you when you have invalid values. Most often, these are letters or words hidden in your column of numbers. Datawrapper tells you the absolute number of invalid values. It also shows you what percentage of the whole column is made up of invalid values. This percentage can help you decide if your data is usable or not.

Here's the Berries column with six invalid values:

How to read a histogram, min, max, median & mean (8)

We hope that this tutorial and the information we show with the histogram helped you understand your data better. If you still have questions, don't hesitate to go in touch with us at support@datawrapper.de.

How to read a histogram, min, max, median & mean (2024)

FAQs

How do you interpret the mean and median of a histogram? ›

Here are some tips for connecting the shape of a histogram with the mean and median:
  1. If the histogram is skewed right, the mean is greater than the median. ...
  2. If the histogram is close to symmetric, then the mean and median are close to each other. ...
  3. If the histogram is skewed left, the mean is less than the median.
Jul 12, 2021

How do you read a histogram value? ›

Here's how it works: A histogram automatically creates equally-sized bins between our lowest and highest values and tells us how many values fall within each bin. The higher the bar, the more values fall in that bin.

How to interpret the spread of a histogram? ›

Peaks and spread

The peaks represent the most common values. Assess the spread of your sample to understand how much your data varies. For example, in this histogram of customer wait times, the peak of the data occurs at about 6 minutes. The data spread is from about 2 minutes to 12 minutes.

How do you describe the data on a histogram? ›

A histogram shows how frequently a value falls into a particular bin. The height of each bar represents the number of values in the data set that fall within a particular bin. When the y-axis is labeled as "count" or "number", the numbers along the y-axis tend to be discrete positive integers.

How do you describe the distribution of a histogram? ›

A histogram with normal distribution is symmetrical. In other words, the same amount of data falls on both sides of the mean. A normal distribution will have a skewness of 0. The direction of skewness is “to the tail.” The larger the number, the longer the tail.

How do you interpret mean and median? ›

The mean (average) of a data set is found by adding all numbers in the data set and then dividing by the number of values in the set. The median is the middle value when a data set is ordered from least to greatest. The mode is the number that occurs most often in a data set. Created by Sal Khan.

What is the best way to describe a histogram? ›

A histogram is a chart that plots the distribution of a numeric variable's values as a series of bars. Each bar typically covers a range of numeric values called a bin or class; a bar's height indicates the frequency of data points with a value within the corresponding bin.

How do you answer a histogram? ›

A histogram is drawn like a bar chart, but often has bars of unequal width. It is the area of the bar that tells us the frequency in a histogram, not its height. Instead of plotting frequency on the y-axis, we plot the frequency density. To calculate this, you divide the frequency of a group by the width of it.

What is the data analyzed to identify in the histogram? ›

Histograms provide a great way to evaluate data. They can be used to check data for extreme values, or outliers, and to help understand the distribution of your data. The distribution of a variable is important to understand when selecting appropriate statistical analysis tools.

How is data represented in a histogram? ›

The histogram is represented by a set of rectangles, adjacent to each other, where each bar represent a kind of data. Statistics is a stream of mathematics that is applied in various fields.

What information is being given by the histogram? ›

A histogram[1] is used to summarize discrete or continuous data. In other words, it provides a visual interpretation of numerical data by showing the number of data points that fall within a specified range of values (called “bins”). It is similar to a vertical bar graph.

What is the relationship of the mean to the median and the mode in a histogram that is strongly skewed to the left? ›

To summarize, generally if the distribution of data is skewed to the left, the mean is less than the median, which is often less than the mode.

Is the mean greater than the median if a histogram is skewed left? ›

The mean is affected by outliers that do not influence the median. Therefore, when the distribution of data is skewed to the left, the mean is often less than the median. When the distribution is skewed to the right, the mean is often greater than the median.

How do you analyze data using mean median and mode? ›

To find the mean, add up the values in the data set and then divide by the number of values that you added. To find the median, list the values of the data set in numerical order and identify which value appears in the middle of the list. To find the mode, identify which value in the data set occurs most often.

What is the mean and median of a positively skewed histogram? ›

The mean of positively skewed data will be greater than the median. In a left-skewed distribution, the mean of negatively skewed data will be less than the median. A right-skewed or positive distribution means its tail is more pronounced on the right side than on the left.

Top Articles
5 Common Quantum Computing Questions from Enterprise Customers, Answered
What are some countries doing to prevent climate change?
7 C's of Communication | The Effective Communication Checklist
Satyaprem Ki Katha review: Kartik Aaryan, Kiara Advani shine in this pure love story on a sensitive subject
Lifewitceee
Boomerang Media Group: Quality Media Solutions
Unitedhealthcare Hwp
Byrn Funeral Home Mayfield Kentucky Obituaries
Gabrielle Abbate Obituary
Craigslist Nj North Cars By Owner
The Best English Movie Theaters In Germany [Ultimate Guide]
Does Pappadeaux Pay Weekly
Oscar Nominated Brings Winning Profile to the Kentucky Turf Cup
Craigslist Farm And Garden Cincinnati Ohio
Best Forensic Pathology Careers + Salary Outlook | HealthGrad
Uktulut Pier Ritual Site
[Cheryll Glotfelty, Harold Fromm] The Ecocriticism(z-lib.org)
Closest Bj Near Me
Isaidup
Ezel Detailing
Marion City Wide Garage Sale 2023
Troy Gamefarm Prices
Timeline of the September 11 Attacks
3569 Vineyard Ave NE, Grand Rapids, MI 49525 - MLS 24048144 - Coldwell Banker
Abga Gestation Calculator
Christmas Days Away
Grays Anatomy Wiki
Www.craigslist.com Syracuse Ny
How to Play the G Chord on Guitar: A Comprehensive Guide - Breakthrough Guitar | Online Guitar Lessons
Planet Fitness Lebanon Nh
About :: Town Of Saugerties
Wal-Mart 2516 Directory
Cox Outage in Bentonville, Arkansas
One Main Branch Locator
Check From Po Box 1111 Charlotte Nc 28201
Wrigley Rooftops Promo Code
Gateway Bible Passage Lookup
Craigslist Freeport Illinois
Henry Ford’s Greatest Achievements and Inventions - World History Edu
Letter of Credit: What It Is, Examples, and How One Is Used
Tunica Inmate Roster Release
Fool's Paradise Showtimes Near Roxy Stadium 14
Vérificateur De Billet Loto-Québec
Petra Gorski Obituary (2024)
Embry Riddle Prescott Academic Calendar
Quest Diagnostics Mt Morris Appointment
Definition of WMT
53 Atms Near Me
Gummy Bear Hoco Proposal
Campaign Blacksmith Bench
Immobiliare di Felice| Appartamento | Appartamento in vendita Porto San
Ihop Deliver
Latest Posts
Article information

Author: Tuan Roob DDS

Last Updated:

Views: 5479

Rating: 4.1 / 5 (62 voted)

Reviews: 93% of readers found this page helpful

Author information

Name: Tuan Roob DDS

Birthday: 1999-11-20

Address: Suite 592 642 Pfannerstill Island, South Keila, LA 74970-3076

Phone: +9617721773649

Job: Marketing Producer

Hobby: Skydiving, Flag Football, Knitting, Running, Lego building, Hunting, Juggling

Introduction: My name is Tuan Roob DDS, I am a friendly, good, energetic, faithful, fantastic, gentle, enchanting person who loves writing and wants to share my knowledge and understanding with you.