Data Science for Studying Language and the Mind
2024-08-27
Follow along on the syllabus!
When you arrive, complete this anonymous form: Who’s in class
You can also join the waitlist if you are not enrolled
Data Sci for Lang & Mind is an entry-level course designed to teach basic principles of statistics and data science to students with little or no background in statistics or computer science. Students will learn to identify patterns in data using visualizations and descriptive statistics; make predictions from data using machine learning and optimization; and quantify the certainty of their predictions using statistical models. This course aims to help students build a foundation of critical thinking and computational skills that will allow them to work with data in all fields related to the study of the mind (e.g. linguistics, psychology, philosophy, cognitive science, neuroscience).
There are no prerequisites beyond high school algebra. No prior programming or statistics experience is necessary, though you will still enjoy this course if you already have a little. Students who have taken several computer science or statistics classes should look for a more advanced course.
Instructor: Dr. Katie Schuler (she/her)
TAs:
You can call me Professor Schuler or Katie, whichever makes you more comfortable
I live in Mt Airy with my husband and two kids (Dory, 2 and Joan, 6)
At Penn I also have a research lab, the Child Language Lab and am on the Natural Science and Math Panel (a group focused on improving inclusive teaching in STEM at Penn).
I’m a first-generation college student from Western NY. I worked 40 hours a week to put myself through college; I am still paying off my student loans.
You are an honest, kind, and hardworking student who wants to do well in and enjoy this class
You are very busy, and will sometimes have to prioritize other things above this class.
Data science is about making decisions based on incomplete information.
This concept is not new. Brains were built for doing this!
The folks who wrote R for Data Science proposed the following data science workflow:
We will spend the first few weeks getting comfortable programming in R, including some useful skills for data science:
Then, we will spend the next several weeks building a foundation in basic statistics and model building:
Finally we will cover a selection of more advanced topics that are often applied in language and mind fields, with a focus on basic understanding:
Each week will include two lectures and a lab:
There are 8 graded assessments:
There are a few policies to take note of:
In addition to our course website, we will use the following:
Please consider using these Penn resources this semester:
With many programming languages available for data science (e.g. R, Python, Julia, MATLAB), why use R?
Google Colab officially supports Python, but secretly supports R (and Julia, too!)
Update 2024: Google Colab now officially supports R!
Google colab demo
File > New notebook
and then Runtime
> Change runtime type
to R+ Code
- write and execute code+ Text
- write text blocks in markdownTable of contents
- outline from text headingsFind and replace
- find and/or replaceFiles
- upload files to cloud sessionFile > Locate in Drive
- where in your Google Drive?File > Save
- savesFile > Revision history
- history of changes you madeFile > Download > Download .ipynb
- used to submit assignments!File > Print
- printsRuntime > Run all
- run all cellsRuntime > Run before
- run all cells before current active cellRuntime > Restart and run all
- restart runtime, then run allCmd/Ctrl+S
- saveCmd/Ctrl+Enter
- run focused cellCmd/Ctrl+Shift+A
- select all cellsCmd/Ctrl+/
- comment/uncomment selectionCmd/Ctrl+]
- increase indentCmd/Ctrl+[
- decrease indentWe begin by defining some basic concepts:
10
- a simple value expression that evaluates to 10
.x <- 10
- an expression that assigns the value of 10
to x
.x + 10
- an expression that adds the value of x
to 10
.a <- x + 10
- an expression that adds the value of x
to 10
and assigns the result to the variable a
Looking forward to a great semester.