DS 6030 | Fall 2024 | University of Virginia

Homework #7: Stacking and Boosting

Author

Your Name Here

Published

October 22, 2024

Stacking for Kaggle

You are to make at least one official entry in the House Prices: Advanced Regression Techniques Kaggle contest using stacking or model averaging; at least one component model must be a boosting model.

You will need to register in Kaggle (its free)
Read the details of the contest. Understand the data and evaluation function.
Make at least one submission that uses stacking or model averaging.
If you get a score on the public leaderboard of \(\text{RMSE}<0.50\) (note RMSE is calculated on the log scale), you receive full credit, otherwise, you’ll lose 10 points.
- I’ll allow teaming. Each team member can produce one component model and then use stacking or model averaging to combine predictions.
- You don’t need to team, but must still combine multiple models. At least one of the component models should be boosting.
Each person submit the following in Canvas:
- Code (if teaming, your code and the shared stacking code)
- kaggle name (or team name) so we can ensure you had a valid submission.
- your score and current ranking on the kaggle leaderboard
Top 5 scores get 2 bonus points
- Teams will split their bonus points among team members

Note: Check out the Kaggle notebooks which let you make submissions directly from the notebook. Its very similar to using Rivanna’s OnDemand in that you can make a RMarkdown/Jupyter notebook or R/Python scrips that run on the cloud. Free CPU (4 cores, 30GB RAM) - amazing! Let your laptops cool off after all their hard work this semester.