Datasets, generalization, and overfitting: Test Your Knowledge Return to pathway Which of the following is an example of a stationary dataset? Rainfall rates in Seattle, Washington Catalog of musical notes in Beethoven’s symphonies Most frequently used words in spam emails Daily ticket sales at a movie theater You are preparing to train a model for predicting the sale price of used cars using a dataset that contains the following features: year, model, and mileage. When you examine the dataset, you discover that mileage values are missing for 150 out of 2,500 examples. Which of the following options would be reasonable actions to take? (Choose all that apply) Choose as many answers as you see fit. Removing those 150 examples from the dataset Removing the mileage column from the dataset, and training the model on just year and model Inserting a value of 0 into all the empty mileage fields Inferring a mileage estimate for each empty field by multiplying the age of the car by the average yearly mileage for all cars in the dataset You are training a streaming service's movie-recommendation model to predict whether or not a user will enjoy a given movie. Which of the following would be reasonable proxy labels for "User enjoyed the movie"? (Choose all that apply) Choose as many answers as you see fit. User saved the movie to their “Want to Watch” list. User clicked "Start watching movie." User recommended the movie to another user. User gave the movie a 5-star rating. True or False: Training your model until it achieves a low loss value on your test data is a good way to prevent overfitting. True False Fill in the blank in the following sentence: Regularization improves your model's ability to generalize to new data by penalizing ___ during training. Incorrect predictions Learning rate Complexity Gradient descent Submit answers error_outline An error occurred when grading the quiz. Please try again.