Docsity
Docsity

Prepare for your exams
Prepare for your exams

Study with the several resources on Docsity


Earn points to download
Earn points to download

Earn points by helping other students or get them with a premium plan


Guidelines and tips
Guidelines and tips

Business Analytics with Python Course at USC Marshall, Study notes of Calculus

The course DSO 459: Business Analytics with Python offered at USC Marshall in Spring 2022. The course aims to develop essential skills for working in a business analytics division within a company, including programming in Python, data analysis, building predictive models, and presenting results to business stakeholders. The course includes 11 deliverables, including homework assignments, case studies, group presentations, and a take-home case study. The midterm exam is both open notes and open book and involves both written and computer portions.

Typology: Study notes

2021/2022

Uploaded on 05/11/2023

millyx
millyx 🇺🇸

4.7

(9)

249 documents

1 / 19

Toggle sidebar

This page cannot be seen from the preview

Don't miss anything!

bg1
pf3
pf4
pf5
pf8
pf9
pfa
pfd
pfe
pff
pf12
pf13

Partial preview of the text

Download Business Analytics with Python Course at USC Marshall and more Study notes Calculus in PDF only on Docsity!

USC Marshall, Data Science and Operations

DSO 459: Business Analytics with Python

Spring 2022 - 4.0 Units

Instructor: Austin Pollok Time: T/Th 4-5:50pm E-mail: pollok@usc.edu Room: ACC 236

Course Description and Policies

Course Description:

In today’s business environment data is ubiquitous. Many companies can thrive or fail by the quality of their data analysts, software developers, and technology or analytics managers. Albeit a rather tall order, it is imperative for these positions to be able to interface with one other to de- rive insights, produce products, and implement data-driven business decisions. Companies such as Amazon, Walmart, Google, American Express, and Disney perform analytics in operations, marketing, product development and management, and strategic planning. This course is about developing the essential skills, through a hands-on perspective, for working in a business analytics division within a company, regardless of the precise position. Such skills include programming, often in Python, answering business questions through data analysis, building predictive models, and presenting results to business stakeholders. The course is intended to develop students with little to no prior programming experience and prepare them to go through an end-to-end analytics life cycle in the context of various business fields. Such a life cycle starts with formulating business objectives and hypotheses, exploring data quantitatively and visually, building models, interpreting the results, and finally communicating actionable recommendations to business stakeholders that add value. This will be accomplished through student participation in end-to-end business analytics and machine learning case studies and projects in Python. Due to the wide-variety of necessary skills required to be a successful entity within an analytics division, the course can be challenging.

Course Communication Policy:

Please communicate! I would like to unblock students as early as possible, meaning I want to clear any questions, confusions, or concerns. We can meet in office hours, over Zoom, on Slack, or in-person, as well as grabbing a coffee for a quick chat. I am very interested in having discussions with all students, whether it be regarding course material, extensions of course materials, programming, or general career discussions in the field of analytics. I would also appreciate hearing any feedback regarding the course, as it is in a state of continual evolution. Please email me to set up any times for discussion, I will typically respond to all emails within two business days, though commonly quicker. Any content specific questions should also be posted on the course Slack channel where I will respond for any students who may be having similar trouble.

Course Grading Policy:

The homework weight is set on a sliding-scale. The reason for this is homework at the beginning is essential to complete the later assignments; it is also assumed to be brand-new to students, and therefore students will be expected to make mistakes, which is essential for learning. The midterm is a check-point to make sure students are familiar with basic Python and data wrangling in Pandas. The presentation will be an opportunity for students to explore a field of interest for a future career in business analytics. The final is meant to simulate a take-home interview screen commonly given to applicants in business analytics or data science.

Assignment Weight

Basic Python HW 1 & 2 2.5% each

Data Wrangling HW 3 & 4 2.5% each

Midterm 15%

Analytics Case Studies HW 5, 6, 7, & 8 7.5% each

Group Presentation 10%

Final (Take-Home Case Study) 35%

Assignments and Grading Policies

Student Deliverables:

Students are expected to produce 11 deliverables by the end of the course.

  1. HW 1: Intro to Python - Assignment, control flow (Twitter JSON example I)...... ??
  2. HW 2: Intro to Python - Iteration, functions (Twitter JSON example II)........... ??
  3. HW 3: Data Wrangling - Data frame operations, aggregation (Uniswap data I)..... ??
  4. HW 4: Data Wrangling - Grouping, merging (Uniswap data II).................... ??
  5. Midterm: Basic Python and Data Wrangling............................ ??(in-class)
  6. HW 5: Scikit-Learn - Estimator API (Zillow California Housing Data).............. ??
  7. HW 6: Case study I (Target Corporation)....................................... ??
  8. HW 7: Case study II (The Home Depot)........................................ ??
  9. HW 8: Case study III (Optiver)................................................. ??
  10. Group Presentations.......................................................... ??
  11. Take-Home Case Study (Final Exam).......................................... ??

Assignment Descriptions and Criterion:

  1. Homework Assignments: Homework assignments are meant to give students a chance to explore a small scale business analytics problem on their own, or collaboratively, while practicing the methods, techniques, and coding skills developed in the more recent lectures. Homework case studies are intended to be an example of the types of analytics problems students might encounter in an interview or, more generally, problems you may encounter in your fist job out of Marshall. In particular, they will be follow up quesitons regarding the in- class lab assignments. Homework will be posted on Blackboard and are to be submitted to Blackboard by the respective due date. Homework will typically be an extention or completion of in-class labs.
  2. Midterm Exam: The midterm exam will be both open notes and open book. It will be cumulative and is meant to help gauge student comprehension of the material. It will involve both written and computer portions. There will be a review session in the class lecture before the scheduled exam date.
  3. Group Presentation: The purpose of this assignment is for students to research a field of data science and analytics which they have an interest in. At the beginning of the semester, I will ask

post anything related to the assignments online. Failure to abide by the above guidelines may constitute a case of suspected plagiarism or cheating, which will be reported and investigated by USC. Please see the “Academic Integrity and Conduct” section below for further details.

Course Schedule

Semester Outline:

  1. Course begins on Tuesday 1/11/22.
  2. Spring 2022 semester has 16 calendar weeks, one of which is lost for Spring recess, which leaves 15 weeks for lecture.
  3. Spring 2022 semester has two holidays apart from Spring recess, MLK day on Monday, 1/17/22, and President’s day on Monday, 2/21/22. Neither affect course lectures.
  4. From the 15 lecture weeks, 3 weeks will be devoted to a midterm, presentations, and a final review, leaving 12 weeks for new content.
  5. From the 12 new content weeks, 2 weeks are devoted to intro to Python, 3 weeks are devoted to Pandas and EDA, 5 (6) weeks are devoted to analytics, modeling and case studies, and 2 (1) weeks are reserved for spillover.
  6. From the 2 weeks of intro to Python, there will be 2 HW. From the 3 weeks of Pandas and EDA there will be 2 HW. From the 5 (6) weeks of analytics, modeling, and case studies, there will be 4 HW. A total of 8 HW will be assigned throughout the 12 weeks of new course content.
  7. Course ends on Thursday 4/28/22.

Tentative Course Outline: WORK IN PROGRESS

This outline is tentative and subject to change, including the case studies. It will be updated as the course progresses. Interesting links: There is a number of great websites and podcasts that provide additional interesting information. See for instance MIT Machine Learning News, MIT Big Data News, Hitchhiker’s Guide to Python, Towards Data Science, FiveThirtyEight, and Real Python.

First Module: Intro to Python

Week 1: Intro to Python - Twitter JSON data I This week will introduce the course structure, give an introduction to data science and analytics, and introduction to Python. Tuesday Lecture 1/ ∗ Sy llabus.ipy nb ∗ Intr o to Data Science.ipy nb Thursday Lab 1/ ∗ Intr o to P y thon.ipy nb ∗ Twitter JSON example

Week 2: Intro to Python - Twitter JSON data II:

We continue learning the basics of Python in the context of parsing Twitter data in JSON format.

Tuesday Lecture 1/ ∗ Thursday Lab 1/ ∗

CheatSheet  Python Cheat Sheet

Second Module: Intro to Pandas

Week 3: Pandas - Uniswap Case Study I:

This week we begin learning how to wrangle with data in Python’s data analysis package Pandas. We will apply our knowledge in the context of a blockchain-based decentralized exchange for pools of cryptocurrencies.

Tuesday Lecture 1/ ∗ Thursday Lab 1/ ∗ CheatSheet  Pandas Cheat Sheet I

Week 4: Pandas and EDA - Uniswap Case Study II

We’ll continue learning different aspects of Pandas, aggregation and grouping, and practice conducting exploratory data analysis to gain insights into the Uniswap pool facility data.

Tuesday Lecture 2/ ∗ Thursday Lab 2/ ∗ CheatSheet  Pandas Cheat Sheet II

Week 5: Pandas and EDA - Uniswap Case Study III

We will continue with Pandas, merging and concatenating data frames, and collect our insights from our exploratory data analysis with Uniswap data.

Tuesday Lecture 2/ ∗ Thursday Lab 2/

Third Module: Midterm on Intro to Python and Pandas

Week 6: Review and Midterm on Python and Pandas

We will reserve one session for basic Python and data wrangling with Pandas review. There will be an in-class midterm as well.

Tuesday Lecture 2/ ∗ Rev iew .ipy nb Thursday Lab 2/ ∗ Midter m.ipy nb CheatSheet

 Midterm Review

Fourth Module: Predictive Analytics Case Studies

Week 7: Intro to Scikit-Learn’s Estimator API - Zillow California Housing Data

Introduction to Scikit-Learn Estimator API and the general scaffolding of a machine learn- ing problem in the context of a Zillow housing price forecasting application.

Tuesday Lecture 2/ ∗ Thursday Lab 2/ ∗ CheatSheet

 Scikit-Learn Cheat Sheet I

Week 8: Previous Week Continued

We will continue investigating Zillow housing price data and formulating a business ana- lytics application into running Python code.

Tuesday Lecture 3/ ∗ Thursday Lab 3/ ∗

CheatSheet  Scikit-Learn Cheat Sheet II  Scikit-Learn Cheat Sheet III

Week 9: Walmart Store Sales Forecasting

We will cover the necessarily analytics tools to answer this week’s business application.

Tuesday Lecture 3/ ∗ Thursday Lab 3/

Fifth Module: Presentations, Review, Take-Home Final Case Study

Week 15: Presentations

We will have each group present their findings to the class, as well as share the mock interview on the course drive.

Tuesday Lecture 4/ ∗ Presentations I Thursday Lab 4/ ∗ Presentations II

Week 16: Course Review and Intro to GitHub

To wrap up the course, we will spend time reflecting on our case studies and the techniques we have studied to answer analytics questions, which are posed to add value for a company. Additionally, we will give a brief introduction to GitHub and Git to demonstrate how you might showcase your skills to potential employers.

Tuesday Lecture 4/ ∗ Thursday Lab 4/ ∗

Week 17: Take-Home Case Study (Final Exam)

Students will complete a case study at home and submit by the required deadline. This is an opportunity to apply the techniques learned throughout the course, as well as techniques you have learned on your own, and take ownership of a project you could potentially put on your GitHub and show off to employers.

University Scheduled Final Exam Day 5/5 - 4:30 to 6:30pm ∗ We will use this period to discuss the final take-home exam and how it might be used in future interviews.

USC Statement on Academic Conduct and Support Systems

Academic Conduct: Plagiarism – presenting someone else’s ideas as your own, either verbatim or recast in your own words – is a serious academic offense with serious consequences. Please familiarize yourself with the discussion of plagiarism in SCampus in Part B, Section 11, “Behavior Violating Univer- sity Standards”. Other forms of academic dishonesty are equally unacceptable. See additional information in SCampus and university policies on Research and Scholarship Misconduct. Students and Disability Accommodations: USC welcomes students with disabilities into all of the University’s educational programs. The Office of Student Accessibility Services (OSAS) is responsible for the determination of appropriate accommodations for students who encounter disability-related barriers. Once a student has completed the OSAS process (registration, initial appointment, and submitted doc- umentation) and accommodations are determined to be reasonable and appropriate, a Letter of Accommodation (LOA) will be available to generate for each course. The LOA must be given to each course instructor by the student and followed up with a discussion. This should be done as early in the semester as possible as accommodations are not retroactive. More information can be found at osas.usc.edu. You may contact OSAS at (213) 740-0776 or via email at osasfrontdesk@usc.edu. Support Systems: Counseling and Mental Health - (213) 740-9355 – 24/7 on call studenthealth.usc.edu/counseling Free and confidential mental health treatment for students, including short-term psychotherapy, group counseling, stress fitness workshops, and crisis intervention. National Suicide Prevention Lifeline - 1 (800) 273-8255 – 24/7 on call suicidepreventionlifeline.org Free and confidential emotional support to people in suicidal crisis or emotional distress 24 hours a day, 7 days a week. Relationship and Sexual Violence Prevention Services (RSVP) - (213) 740-9355(WELL), press “0” after hours – 24/7 on call studenthealth.usc.edu/sexual-assault Free and confidential therapy services, workshops, and training for situations related to gender- based harm. Office for Equity, Equal Opportunity, and Title IX (EEO-TIX) - (213) 740- eeotix.usc.edu Information about how to get help or help someone affected by harassment or discrimination, rights of protected classes, reporting options, and additional resources for students, faculty, staff, visitors, and applicants.