add lectures 26, 27, 28

48651679 · Ashwin Maran · 01353f34 · 48651679 · 48651679 · 48651679
Commit 48651679 authored 1 year ago by Ashwin Maran
--- a/s24/AmFam_Ashwin/26_Files_and_Directories/26_Files_and_Directories.pdf
+++ b/s24/AmFam_Ashwin/26_Files_and_Directories/26_Files_and_Directories.pdf
--- a/s24/AmFam_Ashwin/26_Files_and_Directories/26_Files_and_Directories.pptx
+++ b/s24/AmFam_Ashwin/26_Files_and_Directories/26_Files_and_Directories.pptx
--- a/s24/AmFam_Ashwin/26_Files_and_Directories/Lecture Code/Lec26_Files_and_Directories_Solution.ipynb
+++ b/s24/AmFam_Ashwin/26_Files_and_Directories/Lecture Code/Lec26_Files_and_Directories_Solution.ipynb
--- a/s24/AmFam_Ashwin/26_Files_and_Directories/Lecture Code/Lec26_Files_and_Directories_Template.ipynb
+++ b/s24/AmFam_Ashwin/26_Files_and_Directories/Lecture Code/Lec26_Files_and_Directories_Template.ipynb
--- a/s24/AmFam_Ashwin/27_Pandas-1/Lecture Code/Lec27_Pandas1_Solution.ipynb
+++ b/s24/AmFam_Ashwin/27_Pandas-1/Lecture Code/Lec27_Pandas1_Solution.ipynb
--- a/s24/AmFam_Ashwin/27_Pandas-1/Lecture Code/Lec27_Pandas1_Template.ipynb
+++ b/s24/AmFam_Ashwin/27_Pandas-1/Lecture Code/Lec27_Pandas1_Template.ipynb
+{
+ "cells": [
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "## Warmup 0: Importing Pandas!"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "import pandas as pd"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "## Warmup 1: Find the mean, median, mode, and standard deviation of the following list of scores"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "my_scores = [44, 32, 19, 67, 23, 23, 92, 47, 47, 78, 84]"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "# write your code here"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "## Learning Objectives\n",
+    "- Create a pandas **Series** from a **list** or from a **dict**,\n",
+    "- Use **Series** methods `max`, `min`, `mean`, `median`, `mode`, `quantile`, `value_counts`,\n",
+    "- Extract elements from a **Series** using **Boolean indexing**,\n",
+    "- Access **Series** members using `.loc`, `.iloc`, `.items`, and slicing,\n",
+    "- Perform **Series** element-wise operations"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "# Pandas"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "**What is Pandas?**\n",
+    " - Pandas is a package of tools for doing Data Science\n",
+    " - Pandas was installed with Anaconda, so its on your computers\n",
+    " - [Learn More](https://en.wikipedia.org/wiki/Pandas_(software))\n",
+    " \n",
+    "If for some reason, you don't have pandas installed, run the following command in terminal or powershell...\n",
+    "<pre>pip install pandas</pre>"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "A Pandas Series is like a combination of a list and a dictionary. The word 'index' is used to describe position.\n"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "## Series from a `list`"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "scores = pd.Series([44, 32, 19, 67, 23, 23, 92, 47, 47, 78, 84])\n",
+    "scores"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "A Pandas series acts a lot like a list; you can index and slice."
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "scores[3]"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "scores[3:6]"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "### Series calculations: mean, median, mode, quartiles, sd, count"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "#### `mean`, `median`, and `std` return the mean, median, and standard deviation"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "print(scores.mean())\n",
+    "print(scores.median())\n",
+    "print(scores.std())"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "#### There could be multiple modes, so `mode` returns a Series"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "print(scores.mode())"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "#### `quantile` returns a Series of the numbers at each specified quantile"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "print(scores.quantile([1.0, 0.75, 0.5, 0.25, 0]))"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "print(scores.quantile([0.9, 0.1]))"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "#### `value_counts` creates a series where the index is the data, and the value is its count in the series"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "ages = pd.Series([18, 19, 20, 20, 20, 17, 18, 24, 25, 35, 22, 20, 21, 21, 20, 23, 23, 19, 19, 19, 20, 21])\n",
+    "ages.value_counts()"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "#### A series can be sorted by index or by values"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "ages.value_counts().sort_index()"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "ages.value_counts().sort_values(ascending=False)"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "### Plotting"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "## Series bar chart"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "age_plot = ages.value_counts().sort_index().plot.bar(color='lightsalmon')\n",
+    "age_plot.set(xlabel=\"age\", ylabel=\"count\")"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "# Filtering"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "## Example 1: What ages are at least 21?"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "at_least_21 = ages[ages >= 21]\n",
+    "at_least_21"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "## Exercise 1: What ages are exactly 18?"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "# write your code here"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "## Advanced Filtering\n",
+    " - `&` means `and`\n",
+    " - `|` means `or`\n",
+    " - `~` means `not`\n",
+    " - we must use `()` for compound boolean expressions"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "## Example 2: What ages are in the range 18 to 20, inclusive?"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "certain_students = ages[(ages >= 18) & (ages <= 20)]\n",
+    "certain_students"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "## Exercise 2: What percentage of students are in this age range?"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "# write your code here"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "## Exercise 3: What percentage of students are ages 18 OR 21?"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "# write your code here"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "## Exercise 4: What percentage of students are NOT 19?"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "# write your code here"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "#### One more thing....\n",
+    "\n",
+    "We can perform an operation on all values in a Series"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "## Example 3: Add 1 to everyone's age"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "ages += 1\n",
+    "ages.value_counts()"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "# Using a Series to store Pokemon stats"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "# Modified from https://automatetheboringstuff.com/chapter14/\n",
+    "import csv\n",
+    "def process_csv(filename):\n",
+    "    example_file = open(filename, encoding=\"utf-8\")\n",
+    "    example_reader = csv.reader(example_file)\n",
+    "    example_data = list(example_reader)\n",
+    "    example_file.close()\n",
+    "    return example_data\n",
+    "\n",
+    "data = process_csv(\"pokemon_stats.csv\")\n",
+    "header = data[0]\n",
+    "print(len(data))\n",
+    "data = data[1:]\n",
+    "data[15:18]"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "## Example 4: Create a Series of all the Pokemon names"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "pokemon_list = [row[1] for row in data]\n",
+    "pokemon_names = pd.Series(pokemon_list)\n",
+    "pokemon_names"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "## Exercise 5: Create a Series of all the Pokemon HPs"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "# write your code here"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "## Exercise 6: Find the most common HP"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "# write your code here"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "## Exercise 7: Find how many Pokemon have that most common HP"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "# write your code here "
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "## Exercise 8: How many Pokemon have HP between 50 and 75 (inclusive)?"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "# write your code here "
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "## Example 5: What are the names of weak Pokemon (`< 30` HP)?"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "weak_hps_idx = hps[hps < 30].index\n",
+    "pokemon_names[weak_hps_idx]"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "## Exercise 9: What are the names of the Pokemon from strongest to weakest (using HP)?"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "# write your code here"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "## Series from a `dict`\n",
+    "A Series is a cross between a list and a dict, so we can make a series from a dict as well"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "game1_points = pd.Series({\"Chris\": 10, \"Kiara\": 3, \"Mikayla\": 7, \"Ann\": 8, \"Trish\": 6})\n",
+    "print(game1_points)"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "game2_points = pd.Series({\"Kiara\": 7, \"Chris\": 3,  \"Trish\": 11, \"Mikayla\": 2, \"Ann\": 5})\n",
+    "print(game2_points)"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "#### Pandas can perform operations on two series by matching up their indices"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "total = game1_points  + game2_points\n",
+    "total"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "## Example 6: Who has the most points in total?"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "print(total.max())\n",
+    "print(total.idxmax())"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "#### We can use `[]` to index the name"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "total['Kiara']"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "#### We can also use `[]` to index by the sequence number, but this should be avoided, and this feature will not be available in future versions of Pandas"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "total[2]"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "#### We can have multi-indexing, slightly different from slicing"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "total[[\"Chris\", \"Trish\"]]"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "### More plotting:"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "total_sorted = total.sort_values(ascending=False)\n",
+    "total_sorted"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "ax = total_sorted.plot.bar(color=\"green\", fontsize=16)\n",
+    "ax.set_ylabel(\"total points\", fontsize=16)"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "## More things to know about Series\n",
+    "Next, we'll get into more ways to access data using `loc` and `iloc`."
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "game1_points"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "game1_points.iloc[2] # looks up by integer position"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "game1_points.loc[\"Mikayla\"] # looks up by pandas index"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "my_new_series = pd.Series({1: 89, 2: 104, 3: 681}) # this can be tricky!\n",
+    "my_new_series"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "my_new_series.iloc[1] # by integer position"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "my_new_series.loc[1] # by index"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "my_new_series[1] # by index!"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "my_new_series[my_new_series > 100] # ... and also boolean indexing!"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "Feel overwhelmed? Do the required reading."
+   ]
+  }
+ ],
+ "metadata": {
+  "kernelspec": {
+   "display_name": "Python 3 (ipykernel)",
+   "language": "python",
+   "name": "python3"
+  },
+  "language_info": {
+   "codemirror_mode": {
+    "name": "ipython",
+    "version": 3
+   },
+   "file_extension": ".py",
+   "mimetype": "text/x-python",
+   "name": "python",
+   "nbconvert_exporter": "python",
+   "pygments_lexer": "ipython3",
+   "version": "3.11.7"
+  }
+ },
+ "nbformat": 4,
+ "nbformat_minor": 4
+}
+%% Cell type:markdown id: tags:
+
+## Warmup 0: Importing Pandas!
+
+%% Cell type:code id: tags:
+
+``` python
+import pandas as pd
+```
+
+%% Cell type:markdown id: tags:
+
+## Warmup 1: Find the mean, median, mode, and standard deviation of the following list of scores
+
+%% Cell type:code id: tags:
+
+``` python
+my_scores = [44, 32, 19, 67, 23, 23, 92, 47, 47, 78, 84]
+```
+
+%% Cell type:code id: tags:
+
+``` python
+# write your code here
+```
+
+%% Cell type:markdown id: tags:
+
+## Learning Objectives
+- Create a pandas **Series** from a **list** or from a **dict**,
+- Use **Series** methods `max`, `min`, `mean`, `median`, `mode`, `quantile`, `value_counts`,
+- Extract elements from a **Series** using **Boolean indexing**,
+- Access **Series** members using `.loc`, `.iloc`, `.items`, and slicing,
+- Perform **Series** element-wise operations
+
+%% Cell type:markdown id: tags:
+
+# Pandas
+
+%% Cell type:markdown id: tags:
+
+**What is Pandas?**
+ - Pandas is a package of tools for doing Data Science
+ - Pandas was installed with Anaconda, so its on your computers
+ - [Learn More](https://en.wikipedia.org/wiki/Pandas_(software))
+
+If for some reason, you don't have pandas installed, run the following command in terminal or powershell...
+<pre>pip install pandas</pre>
+
+%% Cell type:markdown id: tags:
+
+A Pandas Series is like a combination of a list and a dictionary. The word 'index' is used to describe position.
+
+%% Cell type:markdown id: tags:
+
+## Series from a `list`
+
+%% Cell type:code id: tags:
+
+``` python
+scores = pd.Series([44, 32, 19, 67, 23, 23, 92, 47, 47, 78, 84])
+scores
+```
+
+%% Cell type:markdown id: tags:
+
+A Pandas series acts a lot like a list; you can index and slice.
+
+%% Cell type:code id: tags:
+
+``` python
+scores[3]
+```
+
+%% Cell type:code id: tags:
+
+``` python
+scores[3:6]
+```
+
+%% Cell type:markdown id: tags:
+
+### Series calculations: mean, median, mode, quartiles, sd, count
+
+%% Cell type:markdown id: tags:
+
+#### `mean`, `median`, and `std` return the mean, median, and standard deviation
+
+%% Cell type:code id: tags:
+
+``` python
+print(scores.mean())
+print(scores.median())
+print(scores.std())
+```
+
+%% Cell type:markdown id: tags:
+
+#### There could be multiple modes, so `mode` returns a Series
+
+%% Cell type:code id: tags:
+
+``` python
+print(scores.mode())
+```
+
+%% Cell type:markdown id: tags:
+
+#### `quantile` returns a Series of the numbers at each specified quantile
+
+%% Cell type:code id: tags:
+
+``` python
+print(scores.quantile([1.0, 0.75, 0.5, 0.25, 0]))
+```
+
+%% Cell type:code id: tags:
+
+``` python
+print(scores.quantile([0.9, 0.1]))
+```
+
+%% Cell type:markdown id: tags:
+
+#### `value_counts` creates a series where the index is the data, and the value is its count in the series
+
+%% Cell type:code id: tags:
+
+``` python
+ages = pd.Series([18, 19, 20, 20, 20, 17, 18, 24, 25, 35, 22, 20, 21, 21, 20, 23, 23, 19, 19, 19, 20, 21])
+ages.value_counts()
+```
+
+%% Cell type:markdown id: tags:
+
+#### A series can be sorted by index or by values
+
+%% Cell type:code id: tags:
+
+``` python
+ages.value_counts().sort_index()
+```
+
+%% Cell type:code id: tags:
+
+``` python
+ages.value_counts().sort_values(ascending=False)
+```
+
+%% Cell type:markdown id: tags:
+
+### Plotting
+
+%% Cell type:markdown id: tags:
+
+## Series bar chart
+
+%% Cell type:code id: tags:
+
+``` python
+age_plot = ages.value_counts().sort_index().plot.bar(color='lightsalmon')
+age_plot.set(xlabel="age", ylabel="count")
+```
+
+%% Cell type:markdown id: tags:
+
+# Filtering
+
+%% Cell type:markdown id: tags:
+
+## Example 1: What ages are at least 21?
+
+%% Cell type:code id: tags:
+
+``` python
+at_least_21 = ages[ages >= 21]
+at_least_21
+```
+
+%% Cell type:markdown id: tags:
+
+## Exercise 1: What ages are exactly 18?
+
+%% Cell type:code id: tags:
+
+``` python
+# write your code here
+```
+
+%% Cell type:markdown id: tags:
+
+## Advanced Filtering
+ - `&` means `and`
+ - `|` means `or`
+ - `~` means `not`
+ - we must use `()` for compound boolean expressions
+
+%% Cell type:markdown id: tags:
+
+## Example 2: What ages are in the range 18 to 20, inclusive?
+
+%% Cell type:code id: tags:
+
+``` python
+certain_students = ages[(ages >= 18) & (ages <= 20)]
+certain_students
+```
+
+%% Cell type:markdown id: tags:
+
+## Exercise 2: What percentage of students are in this age range?
+
+%% Cell type:code id: tags:
+
+``` python
+# write your code here
+```
+
+%% Cell type:markdown id: tags:
+
+## Exercise 3: What percentage of students are ages 18 OR 21?
+
+%% Cell type:code id: tags:
+
+``` python
+# write your code here
+```
+
+%% Cell type:markdown id: tags:
+
+## Exercise 4: What percentage of students are NOT 19?
+
+%% Cell type:code id: tags:
+
+``` python
+# write your code here
+```
+
+%% Cell type:markdown id: tags:
+
+#### One more thing....
+
+We can perform an operation on all values in a Series
+
+%% Cell type:markdown id: tags:
+
+## Example 3: Add 1 to everyone's age
+
+%% Cell type:code id: tags:
+
+``` python
+ages += 1
+ages.value_counts()
+```
+
+%% Cell type:markdown id: tags:
+
+# Using a Series to store Pokemon stats
+
+%% Cell type:code id: tags:
+
+``` python
+# Modified from https://automatetheboringstuff.com/chapter14/
+import csv
+def process_csv(filename):
+    example_file = open(filename, encoding="utf-8")
+    example_reader = csv.reader(example_file)
+    example_data = list(example_reader)
+    example_file.close()
+    return example_data
+
+data = process_csv("pokemon_stats.csv")
+header = data[0]
+print(len(data))
+data = data[1:]
+data[15:18]
+```
+
+%% Cell type:markdown id: tags:
+
+## Example 4: Create a Series of all the Pokemon names
+
+%% Cell type:code id: tags:
+
+``` python
+pokemon_list = [row[1] for row in data]
+pokemon_names = pd.Series(pokemon_list)
+pokemon_names
+```
+
+%% Cell type:markdown id: tags:
+
+## Exercise 5: Create a Series of all the Pokemon HPs
+
+%% Cell type:code id: tags:
+
+``` python
+# write your code here
+```
+
+%% Cell type:markdown id: tags:
+
+## Exercise 6: Find the most common HP
+
+%% Cell type:code id: tags:
+
+``` python
+# write your code here
+```
+
+%% Cell type:markdown id: tags:
+
+## Exercise 7: Find how many Pokemon have that most common HP
+
+%% Cell type:code id: tags:
+
+``` python
+# write your code here
+```
+
+%% Cell type:markdown id: tags:
+
+## Exercise 8: How many Pokemon have HP between 50 and 75 (inclusive)?
+
+%% Cell type:code id: tags:
+
+``` python
+# write your code here
+```
+
+%% Cell type:markdown id: tags:
+
+## Example 5: What are the names of weak Pokemon (`< 30` HP)?
+
+%% Cell type:code id: tags:
+
+``` python
+weak_hps_idx = hps[hps < 30].index
+pokemon_names[weak_hps_idx]
+```
+
+%% Cell type:markdown id: tags:
+
+## Exercise 9: What are the names of the Pokemon from strongest to weakest (using HP)?
+
+%% Cell type:code id: tags:
+
+``` python
+# write your code here
+```
+
+%% Cell type:markdown id: tags:
+
+## Series from a `dict`
+A Series is a cross between a list and a dict, so we can make a series from a dict as well
+
+%% Cell type:code id: tags:
+
+``` python
+game1_points = pd.Series({"Chris": 10, "Kiara": 3, "Mikayla": 7, "Ann": 8, "Trish": 6})
+print(game1_points)
+```
+
+%% Cell type:code id: tags:
+
+``` python
+game2_points = pd.Series({"Kiara": 7, "Chris": 3,  "Trish": 11, "Mikayla": 2, "Ann": 5})
+print(game2_points)
+```
+
+%% Cell type:markdown id: tags:
+
+#### Pandas can perform operations on two series by matching up their indices
+
+%% Cell type:code id: tags:
+
+``` python
+total = game1_points  + game2_points
+total
+```
+
+%% Cell type:markdown id: tags:
+
+## Example 6: Who has the most points in total?
+
+%% Cell type:code id: tags:
+
+``` python
+print(total.max())
+print(total.idxmax())
+```
+
+%% Cell type:markdown id: tags:
+
+#### We can use `[]` to index the name
+
+%% Cell type:code id: tags:
+
+``` python
+total['Kiara']
+```
+
+%% Cell type:markdown id: tags:
+
+#### We can also use `[]` to index by the sequence number, but this should be avoided, and this feature will not be available in future versions of Pandas
+
+%% Cell type:code id: tags:
+
+``` python
+total[2]
+```
+
+%% Cell type:markdown id: tags:
+
+#### We can have multi-indexing, slightly different from slicing
+
+%% Cell type:code id: tags:
+
+``` python
+total[["Chris", "Trish"]]
+```
+
+%% Cell type:markdown id: tags:
+
+### More plotting:
+
+%% Cell type:code id: tags:
+
+``` python
+total_sorted = total.sort_values(ascending=False)
+total_sorted
+```
+
+%% Cell type:code id: tags:
+
+``` python
+ax = total_sorted.plot.bar(color="green", fontsize=16)
+ax.set_ylabel("total points", fontsize=16)
+```
+
+%% Cell type:markdown id: tags:
+
+## More things to know about Series
+Next, we'll get into more ways to access data using `loc` and `iloc`.
+
+%% Cell type:code id: tags:
+
+``` python
+game1_points
+```
+
+%% Cell type:code id: tags:
+
+``` python
+game1_points.iloc[2] # looks up by integer position
+```
+
+%% Cell type:code id: tags:
+
+``` python
+game1_points.loc["Mikayla"] # looks up by pandas index
+```
+
+%% Cell type:code id: tags:
+
+``` python
+my_new_series = pd.Series({1: 89, 2: 104, 3: 681}) # this can be tricky!
+my_new_series
+```
+
+%% Cell type:code id: tags:
+
+``` python
+my_new_series.iloc[1] # by integer position
+```
+
+%% Cell type:code id: tags:
+
+``` python
+my_new_series.loc[1] # by index
+```
+
+%% Cell type:code id: tags:
+
+``` python
+my_new_series[1] # by index!
+```
+
+%% Cell type:code id: tags:
+
+``` python
+my_new_series[my_new_series > 100] # ... and also boolean indexing!
+```
+
+%% Cell type:markdown id: tags:
+
+Feel overwhelmed? Do the required reading.
--- a/s24/AmFam_Ashwin/27_Pandas-1/Lecture Code/pokemon_stats.csv
+++ b/s24/AmFam_Ashwin/27_Pandas-1/Lecture Code/pokemon_stats.csv
--- a/s24/AmFam_Ashwin/28_Pandas-2/Lec_28_Pandas2_Worksheet.ipynb
+++ b/s24/AmFam_Ashwin/28_Pandas-2/Lec_28_Pandas2_Worksheet.ipynb
--- a/s24/AmFam_Ashwin/28_Pandas-2/Lecture Code/IMDB-Movie-Data.csv
+++ b/s24/AmFam_Ashwin/28_Pandas-2/Lecture Code/IMDB-Movie-Data.csv
--- a/s24/AmFam_Ashwin/28_Pandas-2/Lecture Code/Lec28_Pandas2_Solution.ipynb
+++ b/s24/AmFam_Ashwin/28_Pandas-2/Lecture Code/Lec28_Pandas2_Solution.ipynb
--- a/s24/AmFam_Ashwin/28_Pandas-2/Lecture Code/Lec28_Pandas2_Template.ipynb
+++ b/s24/AmFam_Ashwin/28_Pandas-2/Lecture Code/Lec28_Pandas2_Template.ipynb
--- a/s24/AmFam_Ashwin/28_Pandas-2/lec-28-worksheet.pdf
+++ b/s24/AmFam_Ashwin/28_Pandas-2/lec-28-worksheet.pdf
--- a/s24/AmFam_Ashwin/28_Pandas-2/storms.csv
+++ b/s24/AmFam_Ashwin/28_Pandas-2/storms.csv
+name,year,type,speed,place
+alice,2016,tornado,100,o
+bob,2016,hurricane,200,p
+cindy,2017,tornado,150,o
+dan,2018,tornado,300,o
+eve,2018,hurricane,250,a
\ No newline at end of file