Notebook
ASR with Whisper
Explore the capabilities of OpenAI's Whisper for automatic speech recognition by creating your own voice recordings!
Tensorflow 2.0 notebook to explain and visualize a HuggingFace BERT for Question Answering model.
Notebooks
NLP for Question Answering
Ongoing posts and code documenting the process of building a question answering model.
Explore how to use LIME and SHAP for interpretability.
Prototype
Refractor
Refractor predicts churn probabilities for telecom customers and shows which customer attributes contribute to those predictions.
Prototype
Anomagram
An interactive visualization tool for exploring how a deep learning model can be applied to the task of anomaly detection.
Prototype
Blip
Blip visualizes how four different anomaly detection algorithms perform at detecting network attacks.
Demo
S-quote
A semantic search engine that takes some input text and returns relevant famous quotes.
Prototype
Textflix
Textflix uses movie reviews to show how machine learning can unlock the data embedded in large amounts of unstructured text.
Prototype
ConvNet Playground
With ConvNet Playground you can explore how a convolutional neural network does semantic image search.
Notebook
Weak supervision with Snorkel
A notebook showing how to train a complaint classifier with Snorkel. Using data from the Consumer Financial Protection Bureau.
Prototype
Active Learner
An interactive visualization of active learning data labeling strategies for supervised machine learning.
Library
Handtrack.js
Handtrack.js is a library for prototyping realtime hand detection (bounding box), directly in the browser.
A toy example about logistic regression and different active learning strategies.
Prototype
Turbofan Tycoon
See if you have what it takes to make it as a turbofan factory owner in our federated learning prototype.
Prototype
Probabilistic Real Estate
A probabilistic programming prototype that predicts future real estate prices across New York City boroughs and neighborhoods.
Prototype
Brief Preview
Brief uses neural networks to score and highlight the most interesting sentences within any article.
An interactive notebook about using three.js to render tens of thousands of points.
Prototype
Encartopedia
Encartopedia visualizes Wikipedia topic clusters and plots your journey through them.
An interactive visualization that uses T-SNE to cluster movies together based on user ratings.
Prototype
Luhn Method Demo
Luhn's method, from 1958, provides a foundation for understanding modern auto-summarization techniques.
Cloudera Fast Forward Labs
Making the recently possible useful.
Cloudera Fast Forward Labs is an applied machine learning research group. Our mission is to empower enterprise data science practitioners to apply emergent academic research to production machine learning use cases in practical and socially responsible ways, while also driving innovation through the Cloudera ecosystem. Our team brings thoughtful, creative, and diverse perspectives to deeply researched work. In this way, we strive to help organizations make the most of their ML investment as well as educate and inspire the broader machine learning and data science community.
Cloudera Blog Twitter©2022 Cloudera, Inc. All rights reserved.