Introduction to Python, Pandas, and Jupyter Notebook for Analysis of All of Us Data

October 16–November 6, 2023 (asynchronous with office hours)

This asynchronous course introduces learners to the data analytics package Pandas in Python and the use of this system in Jupyter notebooks. Learners unfamiliar with Python will have the opportunity to familiarize themselves with the basics, while others can move straight into data import, manipulation, and analysis with Pandas.

Pandas provides a simple programmatic way to handle spreadsheet-like data, while allowing for advanced scripting to automate complex tasks. This course helps learners master these basics and lays a foundation from which a participant can build toward more specialized goals.

By the end of this course, learners will be able to use Python and Pandas to do the following:

    • Import data from a variety of formats and reorganize it
    • Clean, filter, and manipulate data
    • Run simple analyses

Learners unfamiliar with Python can also learn the basics of the language.

Meet the Instructor

Dr. Eric Butler is an ecologist and Associate Processor of Biology at Shaw University in Raleigh, NC. He began using Python in graduate school when he had to analyze a data set with 250,000 rows and taught himself to program. Since then, he’s written code to analyze color patterns, simulate inheritance and evolutionary processes, create interactive demonstrations for his classes, and analyze large data sets such as eBird.

About the All of Us Researcher Academy

The All of Us Researcher Academy is a comprehensive program that provides training and technical assistance for researchers conducting research with the All of Us Researcher Workbench, the cloud-based analytics platform where registered researchers can access data from All of Us participants. The academy resources are available at no cost to all registered researchers based on availability. The academy also supports peer-to-peer learning and network-building among researchers from institutions that are underrepresented in the biomedical workforce. Currently, the All of Us Researcher Academy is engaging with students, faculty, and post-docs at Historically Black Colleges and Universities (HBCUs) and Hispanic-Serving Institutions (HSIs). HBCUs are institutions described in section 4(b) of Executive Order 14041, incorporating by reference the institutions listed in 34 C.F.R. 608.2]. An HSI is defined as an institution of higher education that is an eligible institution and has an enrollment of an undergraduate full-time equivalent of students that is at least 25 percent Hispanic students at the end of the award year immediately preceding the date of application (20 U.S. Code §1101a).

The All of Us Researcher Academy is supported by the Division of Engagement and Outreach, All of Us Research Program, National Institutes of Health, Award Number 1OT20D028395-01.