Test

Test#

The test will contain fill-in-the-blank, matching, and free response questions, coding and high-level. It will be done on paper, closed-book (no cheat sheets, phones, laptops, etc).

Topics#

The test will cover the following:

  • Working with data using only the Python standard library (“pure Python”)

  • Python/pandas data types/structures

  • Jupyter concepts

  • Data manipulation

    • Boolean indexing

    • Grouping

    • Cleaning

    • Merging

  • Troubleshooting

  • Data visualization

  • Time series

    • Resampling

  • APIs, conceptually

Example questions#

Recommendation: Try completing them without using a search engine, generative AI, the course materials, etc.

  • What types can a pandas column be? Come up with at least three.

  • What are some scenarios where you’d need to convert between Python/pandas types? Come up with at least three.

  • How do you make a link in Markdown?

  • How does groupby() work?

  • What is resampling used for? Include an example.

  • What’s a kernel, in the context of this course?

  • Write the equivalent of the following in pandas.

    import csv
    
    lowest_gdp = None
    lowest_country = None
    
    with open("gdp.csv") as f:
       reader = csv.DictReader(f)
       for row in reader:
          gdp = float(row["GDP"])
          if lowest_gdp is None or gdp < lowest_gdp:
                lowest_gdp = gdp
                lowest_country = row["Country"]
    
    print(lowest_country)
    
  • What’s a choropleth map? Try explaining to someone who’s never seen one before.

  • What are the different kinds of merges? Can describe through text and/or visuals.

  • How would you merge the following Dataframes? Write the code as best you can.

intros

Name

Year introduced

Mickey Mouse

1928

Big Bird

1969

Lisa Simpson

1987

shows

First

Last

Series

Lisa

Simpson

The Simpsons

Mickey

Mouse

multiple

Big

Bird

Sesame Street