Unverified Commit 719e352b authored by Simon Bowly's avatar Simon Bowly
Browse files

add revision exercises for ads1002

parent 26fea5c1
%% Cell type:markdown id:692d1a3a tags:
# Pandas Exercises
* These short exercises involve a combination of filtering, aggregating and joining data to test your knowledge.
* This is also a chance to make sure your notebook environment is working properly!
* Discuss in your group the different ways you could accomplish these tasks, and enter your answers as a group in the flux poll here: https://flux.qa/DSMGW7
* Take your time, and feel free to research/consult last semester's notes.
%% Cell type:code id:3f6fb082 tags:
``` python
import pandas as pd
# This dataframe represents a list of transactions from a very strange
# low-value auction. (I know, I'm not very creative).
sold_items = pd.DataFrame({
"sale_id": [1, 2, 3, 4, 5, 6, 7, 8, 9, 10],
"type": [
"book", "photo", "book", "pen", "book",
"pen", "pen", "book", "pen", "pen"
],
"price": [
10, 5, 9, 1, 7, 0.9, 0.5, 15, 1.2, 0.5,
]
})
sold_items
```
%%%% Output: execute_result
sale_id type price
0 1 book 10.0
1 2 photo 5.0
2 3 book 9.0
3 4 pen 1.0
4 5 book 7.0
5 6 pen 0.9
6 7 pen 0.5
7 8 book 15.0
8 9 pen 1.2
9 10 pen 0.5
%% Cell type:markdown id:8bcebc54 tags:
Q1. Find a single line of code to count the number of items of each type (books, pens, photos) sold.
%% Cell type:code id:7fda3a5e tags:
``` python
```
%% Cell type:markdown id:e33f33c4 tags:
Q2. Find a single line of code to calculate the total value of all transactions.
%% Cell type:code id:b273da27 tags:
``` python
```
%% Cell type:markdown id:fdbf47d3 tags:
Q3. Find a single line of code to calculate some statistic (e.g. mean, max, std.dev) of the sale price of a pen in this auction.
%% Cell type:code id:648c4ec1 tags:
``` python
```
%% Cell type:code id:f8cb0c0c tags:
``` python
# This dataframe contains prices at which the auctioneer bought each item type.
bought_items = pd.DataFrame({
"type": ["book", "photo", "pen"],
"cost": [11, 6, 0.6],
})
bought_items
```
%%%% Output: execute_result
type cost
0 book 11.0
1 photo 6.0
2 pen 0.6
%% Cell type:markdown id:6a1e1490 tags:
Q4. Find a single line of code which creates a new dataframe with the 'bought' cost of each item included in the sold_items table.
%% Cell type:code id:81a34368 tags:
``` python
```
%% Cell type:markdown id:c28b0b63 tags:
Q5. Find a single line of code which returns a dataframe containing only the items for which the auctioneer made a profit.
%% Cell type:code id:a0cdad0d tags:
``` python
```
%% Cell type:markdown id:9a1ba051 tags:
Q6. Find code to recreate the histogram plot below.
%% Cell type:code id:068936d4 tags:
``` python
import seaborn as sns
```
%% Cell type:code id:d6199297 tags:
``` python
```
%%%% Output: execute_result
<AxesSubplot:xlabel='price', ylabel='Count'>
%%%% Output: display_data
![]()
%% Cell type:markdown id:b337513f tags:
Q7. Find code to recreate the scatter plot below.
%% Cell type:code id:79baeb3e tags:
``` python
```
%%%% Output: execute_result
<seaborn.axisgrid.FacetGrid at 0x7f4fd49b6dc0>
%%%% Output: display_data
![]()
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment