Versioning Three Decades of Models
The project is built around a blog post by Andrej Karpathy, in which he recreates Yann Lecun’s 1989 CNN on MNIST, which was a very early breakthrough convolutional neural network. Then with the benefits of 33 years of machine learning knowledge, the Karpathy shows how he can add different features to the model and different data augmentations to improve performance
We can show how MLOps can be used to showcase these changes. We can version models to allow retrieval of models from various eras with different features added/removed, we can do data versioning to show how to manage the processing of handling multiple datasets while maintaining train/val/test splits and we can compare model performances with an experiment tracker.
We can also showcase deployment, demonstrating how you can set up a simple application to which users can submit queries (in the form of images) to any/all models that we’ve trained
As a stretch goal, we can look at if there are any advances in the last three years that can improve the score, or if there are any techniques that would improve the score
Tangible Outputs: The tangible outputs of the project are