Your final project in this course will be a collaborative that uses each element of the data science process to answer questions on a topic of your choosing. Your team will be responsible for finding and cleaning data; producing visualizations and exploratory analyses; producing concrete data-centric deliverables; and disseminating results. You are expect to organize your work and to collaborate using best practices.
You will work closely with other classmates in a team of 4-5 on this project, and are free to form teams of your choosing.
If you can’t find a team, or wish to form a team of a different size, please reach out to the instructor. In general, we do not anticipate that the grades for each group member will be different. We do, however, reserve the right to assign different grades to each group member based on peer assessments or public records of contribution (e.g. through commit histories).
|November 7 by 1:00 pm||Form a team and submit a proposal||Written proposal document|
|November 10-13||Project review meeting||Zoom meeting – no “deliverable”|
|December 5 by 4:00 pm||Report||Written report giving detailed project description|
|December 5 by 4:00 pm||Webpage and screencast||Webpage overview of project, with short explanatory video (published online)|
|December 5 by 8:00 pm||Peer assessment||Brief assessment of your teammates contributions (as a short document)|
|December 10||“In class” discussion of projects||Enjoy hearing about projects! (Also get hex stickers…)|
Other than peer assessments, each deliverable will appear online (as e.g. a GH repo or youtube video). Links should be submitted via Courseworks by one team member (not necessarily the same person for each deliverable).
First, you will define your teams and propose a project. This proposal should be a half-page to a page in length and include:
There should be one proposal per group, written collaboratively using .Rmd (rendering to a GH document) in a dedicated GitHub repo. Conceptually, this is intended to review a project that is 10% complete.
Based on the topic of your proposal, you will work a member of the teaching team; this person will be your primary resource and will guide you through the rest of the project. In particular, you will schedule a project review meeting with your teaching team leader to discuss the proposal, anticipated stumbling blocks, and preliminary work. All team members are required to be present for the meeting. Conceptually, the project review meeting is a 30% review.
The written report produced by your team is central to this project. This will detail how you completed your project, and should cover data collection and cleaning, exploratory analyses, alternative strategies, descriptions of approaches, and a discussion of results. We anticipate that your project will change somewhat over time; these changes and the reasons for them should be documented! You will write one report document per group, and be sure to include all group member names in the document.
Your report should include the following topics. Depending on your project type the amount of discussion you devote to each of them will vary:
As this will be your only chance to describe your project in detail, make sure that your report is a standalone document that fully describes your process and results. We also expect you to write high-quality code that is understandable to an outside reader. Coding collaboratively and actively reviewing code within the team will help with this!
You will create a webpage for your project. This should gives an overview of the project scope, data, approaches, visualizations, and other results, in a way that it accessible to a broad audience. Include the full report as a page in the website, so that readers can find a detailed explanation of your work.
You will also create a two-minute narrated screencast illustrating your project (screencasts are videos of your computer screen with spoken audio explaining what is shown on the screen – see the RStudio webinar page for some examples). You may use slides, demonstrations, or any other content that is relevant to your project. Publish your screencast on youtube, vimeo, or another online platform, and embed the screencast in your website. The two-minute limit will be strictly enforced.
For both the website and the screencast, your audience is classmates who worked on other projects. It will be helpful to put yourself in their shoes, and ask what information you think will be most interesting. We suggest you emphasize motivation, questions, and results over methods; after all, interested folks can view your complete project report on the same page.
Your report and website should be written collaboratively using GitHub. A link to this repo should be included prominently on your website’s landing page.
You will be asked to submit a link to your project website.
It is important to provide positive feedback to people who worked hard for the good of the team and to also make suggestions to those you perceived not to be working as effectively on team tasks. We ask you to provide an honest assessment of the contributions of the members of your team, including yourself. The feedback you provide should reflect your judgment of each team member:
Grading for the final project will roughly follow the rubric below:
As noted above, we anticipate that team members will recieve the same grade in most instances, but we may assign different grades to each group member based on peer assessments or public records of contribution (e.g. through commit histories).
The examples below are drawn from previous submissions in the to give an idea of the range of possible projects.