Production ML Systems: Test Your Knowledge

  1. You are using machine learning to build a classification model that predicts unicorn appearances. Your dataset details 10,000 unicorn appearances and 10,000 unicorn non-appearances. The dataset contains the location, time of day, elevation, temperature, humidity, tree cover, presence of a rainbow, and several other features.

    After launching your unicorn appearance predictor, you will need to keep your model fresh by retraining on new data. Because you are gathering too much new data to train on, you decide to limit the training data by sampling the new data over a window of time. You also need to account for daily and annual patterns in unicorn appearances. What window of time do you choose?

  2. You launch your unicorn appearance predictor. It's working well! You go on vacation and return after three weeks to find that your model quality has dropped significantly. Assume that unicorn behavior is unlikely to change significantly in three weeks. What is the most likely explanation for the decrease in quality?

  3. You review the model's predictions for Antarctica, and discover the model has been making poor predictions there since the model was released into production. Which of the following could be the source of the problem?

  4. Your unicorn appearance predictor has operated for a year. You've fixed many problems, and quality is now high. However, you notice a small but persistent problem. Your model quality has drifted slightly lower in urban areas. What might be the cause?

  5. Through all your troubleshooting, you've greatly improved the quality of the unicorn model's predictions, and as a result, usage has increased tenfold. However, users are now complaining that the model is extremely slow; inference requests typically take more than 30 seconds to return predictions. Which of the following changes could help solve this problem?