Skip to article frontmatterSkip to article content

Lesson 1. Getting Started

The Ohio State University Libraries

This lesson provides an introduction to the Tableau workspace and explores effective strategies for finding and critically assessing publicly available data. Readings also offer guidance on understanding audience needs prior to data visualization and outline key principles for creating more impactful visual representations.

Data skills | concepts

  • Tableau
  • Finding data
  • Reading data

Learning objectives

  1. Apply the DRAMA Framework to critically evaluate a dataset.
  2. Navigate the Tableau start page, data source page, and workspace.

This tutorial is designed to support a multi-session Tableau for Research workshop hosted by The Ohio State University Libraries Research Commons. It is intended to help the ABSOLUTE beginner, or anyone who is relatively new to Tableau to build the skills and confidence to apply Tableau to research projects.

LESSON 1

Finding and/or obtaining data

Data is everywhere. Data for examples and activities in this tutorial was gathered from DataOhio, data.gov, and Wikipedia. Open data—defined as any data that “can be freely used, re-used, and redistributed by anyone, subject only, at most, to the requirement to attribute and share alike”—serves as a valuable resource for developing data literacy. Dietrich et al. (n.d.) While government agencies are primary producers of open data, it is also commonly shared by nonprofit organizations, independent researchers, and data enthusiasts. Enagaging with datasets that hold personal relevance can significantly enhance the learning experience, particularly when developing skills in data analysis and visualization. You are encouraged to download a dataset that is personally meaningful to you to use with the practices activities provided in this tutorial.

University Libraries offers several guides to help you Find Data. You can also talk with a librarian about your unique research data needs. Collecting research data on your own requires an understanding of acceptable methods for rigorously and ethically collecting data for your discipline. You can find current methodology for your discipline by talking to experts in your field, searching the literature, and finding ebooks, journals, and more the library catalog.

If you need data for a project, chances are your data may already be collected by a federal, state, or local governmental entity. Major publishers of U.S. government data include:

Reading data

Reading data effectively requires us to slow down and critically evaluate a dataset to understand its origins, who the dataset was created by and why, how the data was collected, and more. Data is not inherently neutral and context matters. Data analysts have a responsibility to recognize and disclose potential data biases that may distort visualizations and lead to misleading interpretations or narratives. A thorough examination of the dataset’s structure—including its dimensions (categorical variables) and measures (quantitative variables)—and a clear understanding of how these elements are defined, is fundamental to responsible and accurate data analysis.

The DRAMA framework

The DRAMA Framework is a helpful tool for critically evaluating data sources. Consider:

  • D - Date: When was the data last updated? Is it current? Does it reflect current trends?

  • R - Relevance: What procedures were used to collect the data? Is the data relevant to my research project? Did sampling procedures target the right audience or population? What was the context for collecting the data? Is there a description of the data set and what data it does and does not contain?

  • A - Accuracy: Is the data reliable? Valid? Were procedures for gathering the data followed consistently?

  • M - Motivation: Why was the data collected? Are there any potential biases in the data? Was any relevant data not included in the dataset? If yes, was this disclosed?

  • A - Authority: Who collected the data? an individual? a government agency? a business? or a political action committee? Are they credible? Primeau (n.d.)

The Tableau Environment

The Start page

When you open Tableau Desktop, you’ll land on the blue Start Page. Here’s what you’ll find:

  • Connect (Left Pane): Connect to your data sources. Connections to flat files, such as .xlsx, .csv, and .json documents are listed on the top. Direct connections to tables hosted on servers are listed below.
  • Center Pane: Open recently used workbooks.
  • Discover (Right Pane): Learn more about Tableau.
Tableau Desktop Start Page

With each new release, Tableau introduces new features and improvements. The Discover section is a valuable resource for staying informed and seeing how these features may support your ability to work with, analyze, and argue with data.

🌟 Make Learning Meaningful!!!

Let’s take a moment here to emphasize something important:

When learning data analysis and visualization, it’s incredibly helpful to work with data that matters to you.

  • 🎵 Love music? Try using a Wikipedia table listing albums or songs by your favorite artist.^
  • 🍽️ Foodie at heart? Explore recipes using the TheMealDB API.^~
  • 🏀 Into sports? Check out the curated Sports Data Sets from The Ohio State University Sports and Society Initiative.

Working with familiar or interesting data makes the learning process more engaging—and more fun!

The Data Source page

To get started with the Tableau Data Source page, we’ll use the Performers table from the Wikipedia page on Rock and Roll Hall of Fame inductees. The Performers category honors recording artists and bands who have had a significant and lasting impact on the development and legacy of rock and roll.^~

That said, you’re encouraged to use a dataset that’s personally meaningful to you! Feel free to substitute the example with your own data as you work through the practice activities in this tutorial.

To connect to rock_n_roll_performers.csv in your workshop materials.

On Tableau’s Start Page

  1. Go to the Connect pane on the left.
  2. Select Text file.
  3. Navigate to and open the CSV file.

This opens the Data Source Page.

On the Data Source page you’ll find:

  • The Connections pane on the left. Tableau Desktop allows multiple connections, which can be joined or related using common fields. For this tutorial, we will keep things simple and use only the rock_n_roll_performers.csv file.
  • The Canvas in the top center. The canvas displays the rock_n_roll_performers.csv file in a rectangle. Right-click on this rectangle or use the ▼ caret to:
    • Rename tables
    • Join and relate tables
    • Apply pre-filters to your data
  • The Metadata Grid in the bottom center. Here the data headers are displayed as rows. This feature is particularly helpful when you connect to a dataset with multiple tables and fields. The metadata grid allows you to:
    • Understand your data structure
    • Change data types
    • Hide unnecessary fields not required for your analysis
  • The Data Grid on the bottom right. The data grid shows the first 1,000 rows of data in your data source. In the data grid you can:
    • Pivot data
    • Create groups and bins
    • Build calculated fields

Dimensions vs. measures - Part 1

What is a measure?

Think of a measure as a variable used for math. Measures represent our quantitative data or units of measure.

What is a dimension?

Dimensions represent the qualitative data used to segment the measures.

Dimension or measure?

Dimensions sometimes function as measures. Age, for example, can be used to categorize data, or as a measure in a calculation. Measures sometimes function as dimensions. An identification number, for example, can represent a person, yet consist of all numbers. If the identification number is added sequentially to a database, it may also be used in a calculation.

example showing apple, banana, pear, and kiwi as a dimension in the fruit column and the number of apples, bananas, pears, and kiki in the number of fruit column

Figure 1:Dimensions and measures

Change data type - Option 1

The metadata grid for the rock_n_roll_performers.csv dataset primarily shows dimensions. The Index field represents an identification number and can serve as a dimension or measure. The Year field is currently recognized as a whole number. To change the data type of the Year field:

  1. Locate the YEAR field in the metadata grid.
  2. Right-click on the number icon before the field name.
  3. Select Date & Time from the menu.

Table 1:Performers Metadata Grid

TypeField NamePhysical TableRemote Field Name
Performers Metadata GridNumber (whole)Indexrock_n_roll_performers.csvindex
Performers Metadata GridDate & TimeYearrock_n_roll_performers.csvyear
Performers Metadata GridStringImagerock_n_roll_performers.csvimage
Performers Metadata GridStringNamerock_n_roll_performers.csvname
Performers Metadata GridStringInducted Membersrock_n_roll_performers.csvinducted_members
Performers Metadata GridStringPrior Nominationsrock_n_roll_performers.csvprior_nominations
Performers Metadata GridStringInduction Presenterrock_n_roll_performers.csvinduction_presenter
Performers Metadata GridStringArtistrock_n_roll_performers.csvartist
Performers Metadata GridStringImage Urlrock_n_roll_performers.csvimage_url
Performers Metadata GridStringArtist Urlrock_n_roll_performers.csvartist_url

Wide vs. tall data

Sometimes to efficiently and effectively analyze and/or visualize data we must restructure our data from wide to tall format. In Figure 1 above, each individual is represented by a single row, with separate columns for name, age, and number of visits. To transform this data into tall format, each individual would have a row for each measure. This structure includes:

  • A column for the measure name (e.g., “Age”, “Visits”)
  • A column for the measure values (e.g., 34, 5)
example showing data from figure 1 reconfigured in tall format

Figure 2:Tall data

For many Tableau projects, converting data to a tall format can enhance analysis and visualization.


^ Visit the Websites and APIs. Lesson 3. Wikipedia tutorial to learn how to extract tables from HTML using pandas.read_html.
~ See the Websites and APIs. Lesson 4. iCite tutorial and Websites and APIs. Lesson 7. Crossref tutorial to learn how to use APIs to gather data.

The Tableau Workspace

To access the Tableau workspace, select the Sheet1 tab at the bottom of workbook.

  • The Data Pane and the Analytics Pane are located in the Side Bar on the left.
  • The Marks Card is to the right of the Side Bar.
  • The Filters Shelf is above the Marks Card.
  • Dimensions and measures are placed on the columns and rows Shelves.
  • The data visualization is designed in the View.

Dimensions vs. measures - Part 2

When you connect to a data source, Tableau automatically categorizes each field as either a dimension or a measure. In the Data Pane, dimensions appear above the gray line, while measures are listed below it. If Tableau classifies a dimension as a measure, you can easily correct it by dragging the field above the gray line into the dimension area.

Change data type - Option 2

To change the data type of a field in the Tableau Workspace:

  1. Locate the field in the data pane.
  2. Right-click on the data type icon before the field name.
  3. Select the desired data type from the menu.

Toggle between workspace, data source and start page

locationToggles between
Data Source tabbottom leftTableau Workspace and Data Source
Small Tableau logo whitetop leftData Source and Start Page
Small Tableau logotop leftStart Page and Data Source
Sheet 1bottom leftData Source and Tableau Workspace

Supplemental readings

Storytelling with Data

Decorative book cover

Storytelling with Data
by Cole Nussbaumer Knaflic
Hoboken, New Jersey: Wiley, 2015.

Chapter 1: “The Importance of Context.”

Better Data Visualizations

Decorative book cover

Better Data Visualizations
by Jonathan Schwabish
New York: Columbia University Press, 2021.

Chapter 2: “Five Guidelines for Better Data Visualizations.”

References
  1. Dietrich, D., Gray, J., McNamara, T., Poikola, A., Pollock, R., Tait, J., & Zijlstra, T. (n.d.). Open Data Handbook. https://opendatahandbook.org/guide/en/
  2. Primeau, H. (n.d.). Taking the Drama Out of Source Evaluation. In Librarian Yarns. https://www.librarianyarns.com/drama-source-eval