All ML is AI, but not all AI is ML.
For future reference, this is an ever-growing collection of
resources and platforms for machine learning engineers, data analysts,
and DevOps
Discover and curate high-quality training data from trusted sources. This forms the foundation of any ML project, as the quality and quantity of your training data directly impacts model performance.
Tools and resources for managing your own, private, datasets.
`, link: `https://superannotate.com/` })">The Hugging Face Hub hosts a large number of community-curated datasets for a diverse range of tasks such as translation, automatic speech recognition, and image classification.
Alongside the information contained in the dataset card, many datasets include a Dataset Viewer to showcase the data.
`, link: `https://huggingface.co/datasets` })">Select pre-trained models that align with your project requirements. Whether building from scratch or leveraging existing models, choosing the right architecture is crucial.
Kaggle has recently introduced the ability for the community to publish models to Kaggle Models. There are a few ways to accomplish this including exclusively via the UI.
`, link: `https://www.kaggle.com/models` })">Models are stored in repositories, so they benefit from all the features possessed by every repo on the Hugging Face Hub.
Additionally, model repos have attributes that make exploring and using models as easy as possible.
`, link: `https://huggingface.co/models` })">Registry allows individuals and teams across the entire organization to share and collaboratively manage the lifecycle of all models, datasets and other artifacts.
This is meant for private use and W&B does not openly publish models for the wide audience. A minimum free plan is required.
`, link: `https://wandb.ai/` })">Optimise model performance by adjusting parameters for your specific use case. This step adapts pre-trained models to your unique requirements and data.
You can deploy Jupyter Lab from a template container and use popular Python libs to import and fine tune models.
`, link: `https://runpod.io?ref=4qeqqhe7` })">Spaces are built with Streamlit, a Python library for building web applications.
`, link: `https://huggingface.co/spaces` })">Deploy your models to scalable cloud infrastructure for reliable access and collaboration. This ensures your models are available when and where they're needed.
Requires AWS account to access.
Less transparent than Hugging Face.
`, link: `https://aws.amazon.com/sagemaker/?nc=sn&loc=4` })">Spaces are built with Streamlit, a Python library for building web applications.
`, link: `https://huggingface.co/spaces` })">Demonstrate your model's capabilities through interactive demos and documentation. This helps stakeholders understand and interact with your models.
Spaces are built with Streamlit, a Python library for building web applications.
`, link: `https://huggingface.co/spaces` })">Integrate models into production environments with robust CI/CD pipelines. This step brings your models from development to real-world applications.
Requires account
`, link: `https://azure.microsoft.com/en-us/products/machine-learning/?hl=en` })">Spaces are built with Streamlit, a Python library for building web applications.
`, link: `https://huggingface.co/spaces` })">Track model performance and maintain data quality in production. Continuous monitoring ensures your models remain accurate and reliable over time.
While subjective, the categories have been curated based on my experience and research of the ML/AI landscape. They might change in the future.
I'm open to suggestions for new categories or subcategories.
Most of the products have been encountered while working on ML projects for startups. Feel free to suggest new products, services or tools which fit in the categories.
Coming soon.