Blog

Mar 30, 2016 · interview

Shivon Zilis on the Machine Intelligence Landscape

2016 is shaping up to be a big year for machine intelligence. Achievements like DeepMind’s AlphaGo are making headlines in the popular press, large tech companies have started a “platform war” to become the go-to company for A.I., and entrepreneurs are increasingly building machine learning products that have the potential to transform how companies operate.

Exciting though the hype may be, the commercial potential of machine intelligence won’t be realized unless entrepreneurs and data scientists can clearly communicate the business value of new tools to non-technical executives. And the first step to communicating clearly is to define a vocabulary to think through what machine intelligence is, how the different algorithms work, and, most importantly, what practical benefits they can provide across verticals and industries.

Shivon Zilis, a partner and founding member of Bloomberg Beta, is doing just that. She has spent the past few years focused exclusively on machine intelligence, building out a vocabulary and taxonomy to help the community understand activity in the field and communicate new developments clearly and effectively.

We interviewed Zilis to learn her views on the past, present, and future of machine intelligence. Keep reading for highlights!

Let’s start with your background. What drew you to machine intelligence?

I started off as a weird kid interested in tech but lacking local influences to get into engineering at an early age. I was also a sports nerd, and, being from Canada, my first real ambition was to create Moneyball for hockey. Unfortunately, the hockey leagues weren’t capturing data like the baseball leagues, and I definitely didn’t have the resources to pay an army to capture the data required for that kind of task. So after studying economics and philosophy at Yale, I ended up working at IBM. This was at the time when the CFO was transitioning from being a bean counter to truly setting strategic direction for organizations, so I helped with a study about that shift. After, I joined a swat team trying to figure out how IBM, as a tech company, could support the world of microfinance. The project deepened my interest in the potential of data-driven businesses, which became my focus when I moved to Bloomberg. A few years back, my former colleague Matt Turck and I realized we lacked a good framework to think about data companies – not data storing and processing companies, but those that use algorithms intelligently to derive insights and value from data. That framework became the machine intelligence landscape.

Has your degree in economics and philosophy influenced how you think about the tech industry?

You know, the landscape is a decent technical analogy for how my mind works. The combination of liberal arts and economics provided me the mental flexibility that’s needed to provide value as a non-technical person working in the tech. As an investor, I navigate back and forth between understanding the tech behind strong products and understanding the abstract market trends that govern a sector’s development. Philosophy taught me how to see how general trends unfold from first principles, and economics are the fundamentals of the business world. Most of the founders I work with are highly technical, and I have deep respect for their skills; they are the truly special ones, not me! I’ll never drill as deep as they do, but, being truly inspired by their work, we work together to build a common language where they communicate the tech and I help them understand why it matters for the rest of the world.

How do you define machine intelligence?

I consider machine intelligence to be the entire world of learning algorithms, the class of algorithms that provide more intelligence to a system as more data is added to the system. The terms “machine learning” and “artificial intelligence” don’t overlap precisely in our common lexicon, so we needed a new term. But basically, these are algorithms that create products that seem human and smart.

Besides giving a snapshot of current companies in the space, what value does the landscape provide?

One thing I noticed early on is that machine intelligence startups had a marketing problem. Pragmatic business people don’t buy magical algorithms that do everything, which was how early startups marketed their work. The landscape, in turn, provides founders with language to explain what they do to a customer or investor, and helps them ask questions like “what business problem are we solving? Are we helping with recruiting? Research? Who is going to use our tool, data scientists or marketers? etc.” Data leaders in large enterprises are also using the chart to help internal teams think about this domain and how it may shape their future tech strategy. As regards investment strategies, I’d say there are three primary benefits. First, it’s like a ledger to keep track of a rapidly changing ecosystem. Second, it helps illuminate true outliers that could become huge investment opportunities, those companies that don’t neatly fit into the established buckets. Third, and perhaps most importantly, it helps track evolution of the landscape over time, as each chart captures a state in time. We’ve noticed huge changes between the first and second versions of the chart.

The 2014 Machine Intelligence Landscape

What changed over the past two years?

In 2014, many of the core technologies were sold as APIs. Today, there is a proliferation of companies that offer products and services built using the algorithms in the original APIs, but we’ve yet to see a parallel level of market adoption. This is a concerning trend. Alchemy API (now part of IBM) is arguably one of the most successful natural language processing startups and they struggled to break past a few million in revenue. We’re also seeing more machine learning platforms companies, but they struggle to explain to enterprises how they differ from competitors. I think the reason for this, which my partner James Cham thoughtfully pointed out, is that using machine intelligence isn’t as simple as just plugging in a technology; it requires an organizational shift in the way teams build and iterate on products, which enterprises still need help figuring out. As the market matures we will see more adoption. For a long time data science technologies had trouble selling in to the enterprise (it was too early), but the rise of data science tools reflects an underlying shift in the enterprise. Companies are starting to figure out how to productively use armies of data scientists; once teams start to expand rapidly, they open a market for tools that help with collaboration and version control tools (there was no need when it was an army of one). Finally, the most significant change occurred in the top row of the 2016 chart with the rise of the intelligent agent. What used to be packaged as a learning algorithm embedded in software to create static data output is now coming to market in form of a bot. These tools can accept multi-step instructions and negotiate on your behalf. The intelligent agent didn’t even exist as a category two years ago.

Writing about “magic wands,” SaaS tools that extract insights from data and seamlessly integrate those insights into a workflow, you mention the UI for these products should be created to “fortify the user’s knowledge rather than replace it.” Is market adoption slow because people are concerned about intelligent tools dulling their skills or automating their jobs?

That technology dulls certain skills and creates a need for new skills is not new to machine intelligence. Prior to the agricultural revolution, we could tell the difference between plant species and predict rain from the sound of a stream. Plato worried that writing, an early technology, would dull our memory. Tech companies have historically purported to save costs through automation. But I haven’t seen the same type of language in machine intelligence marketing, which tends to promise that it will help knowledge workers do their job better rather than replace them. Consider the HR job description tool Textio. People want to use this because it gives them super powers, funneling collective intelligence into their individual work. It helps to put this into perspective alongside a tool like spell check in email. Of course we want to have a machine improve our spelling! In the future, I expect people will come to use a text optimizer as naturally as they use spell check.

What virtual agents? Daniel Tunkelang wrote an interesting Medium post recently describing how we “trust computers to do things for us, but not as us.” Will the market require a cognitive shift to adopt virtual agents?

It depends on the product category. We have a summarization product in our portfolio (like the prototype you built at Fast Forward Labs) that tells you what you need to know about longer articles you have to read. Assistants like this have a low barrier to adoption. It’s the same with Textio, where users literally just pop in text they’ve written and get analytic feedback. Assistants that engage with others on your behalf have to deal nuances you may or may not even know exist. Using tools like a scheduling assistant can require upfront energy and investment, just like a human relationship! Sure, you can get married, share resources, and basically live separate lives, or you can invest the time required to build closeness and appreciate the nuances of another person.

What do you predict will happen to the machine intelligence market over the next year?

First, I think companies will make progress building great data science teams, and integrate data science more tightly into other business units. I think we’ll see a lot of investment in machine intelligence companies focused on healthcare, and a bunch of logistic companies that can support self-driving cars developed by larger players. I personally would love to see more applications in education, as there could be so much upside for society at large. It’s hard to sell into the education vertical, and really requires a brave individual to take on the task. We’re also really excited about companies like Gridspace that are capturing and processing audio data. There is so much data lost from live meetings or calls that could provide incredible insights. Finally, I’m slightly concerned all the bots being built could rapidly lead to overload and disenchantment. As with the mobile app explosion, the barrier to building a semi-functional bot is not very high, and that could lead to a lot of annoying bots pinging us needlessly. The risk lies in the gap between the hyperbolic expectations generated by the press and the rather banal reality of most products’ performance.

Fun times with the Fast Forward Labs slackbot.

What advice would you give to aspiring entrepreneurs?

A founder needs to be obsessed with a given problem to be successful. She needs to believe so deeply in the mission that she is resilient through criticisms, not winning a funding round, or not immediately winning customers. There are so many businesses out there, but those that are well backed are the outliers that are ready to devote 10 years of their life to one problem, ready to wake up every morning and passionately embrace their work. Today, there are many lucrative options for people with machine learning talents, as large companies are building their teams. We single out those few individuals who show us they’re willing to remake the world to solve their problem.

- Kathryn

Newer

Apr 6, 2016 · guest post

Where Do You Put Your Data Scientists?

Older

Mar 28, 2016 · announcement

Fast Forward Labs Data Leadership Conference

Latest posts

Nov 15, 2022 · newsletter

CFFL November Newsletter

November 2022 Perhaps November conjures thoughts of holiday feasts and festivities, but for us, it’s the perfect time to chew the fat about machine learning! Make room on your plate for a peek behind the scenes into our current research on harnessing synthetic image generation to improve classification tasks. And, as usual, we reflect on our favorite reads of the month. New Research! In the first half of this year, we focused on natural language processing with our Text Style Transfer blog series.

Nov 14, 2022 · post

Implementing CycleGAN

by Michael Gallaspy · Introduction This post documents the first part of a research effort to quantify the impact of synthetic data augmentation in training a deep learning model for detecting manufacturing defects on steel surfaces. We chose to generate synthetic data using CycleGAN,1 an architecture involving several networks that jointly learn a mapping between two image domains from unpaired examples (I’ll elaborate below). Research from recent years has demonstrated improvement on tasks like defect detection2 and image segmentation3 by augmenting real image data sets with synthetic data, since deep learning algorithms require massive amounts of data, and data collection can easily become a bottleneck.

Oct 20, 2022 · newsletter

CFFL October Newsletter

October 2022 We’ve got another action-packed newsletter for October! Highlights this month include the re-release of a classic CFFL research report, an example-heavy tutorial on Dask for distributed ML, and our picks for the best reads of the month. Open Data Science Conference Cloudera Fast Forward Labs will be at ODSC West near San Fransisco on November 1st-3rd, 2022! If you’ll be in the Bay Area, don’t miss Andrew and Melanie who will be presenting our recent research on Neutralizing Subjectivity Bias with HuggingFace Transformers.

Sep 21, 2022 · newsletter

CFFL September Newsletter

September 2022 Welcome to the September edition of the Cloudera Fast Forward Labs newsletter. This month we’re talking about ethics and we have all kinds of goodies to share including the final installment of our Text Style Transfer series and a couple of offerings from our newest research engineer. Throw in some choice must-reads and an ASR demo, and you’ve got yourself an action-packed newsletter! New Research! Ethical Considerations When Designing an NLG System In the final post of our blog series on Text Style Transfer, we discuss some ethical considerations when working with natural language generation systems, and describe the design of our prototype application: Exploring Intelligent Writing Assistance.

Sep 8, 2022 · post

Thought experiment: Human-centric machine learning for comic book creation

by Michael Gallaspy · This post has a companion piece: Ethics Sheet for AI-assisted Comic Book Art Generation I want to make a comic book. Actually, I want to make tools for making comic books. See, the problem is, I can’t draw too good. I mean, I’m working on it. Check out these self portraits drawn 6 months apart: Left: “Sad Face”. February 2022. Right: “Eyyyy”. August 2022. But I have a long way to go until my illustrations would be considered professional quality, notwithstanding the time it would take me to develop the many other skills needed for making comic books.

Aug 18, 2022 · newsletter

CFFL August Newsletter

August 2022 Welcome to the August edition of the Cloudera Fast Forward Labs newsletter. This month we’re thrilled to introduce a new member of the FFL team, share TWO new applied machine learning prototypes we’ve built, and, as always, offer up some intriguing reads. New Research Engineer! If you’re a regular reader of our newsletter, you likely noticed that we’ve been searching for new research engineers to join the Cloudera Fast Forward Labs team.

Reports

In-depth guides to specific machine learning capabilities

FF24

Text Style Transfer

The NLP task of text style transfer (TST) aims to automatically control the style attributes of a piece of text while preserving the content, which is an important consideration for making NLP more user-centric. In this report, we explore text style transfer through an applied use case — neutralizing subjectivity bias in free text. Along the way, we describe our sequence-to-sequence modeling approach leveraging HuggingFace Transformers, and present a set of custom, reference-free evaluation metrics for quantifying model performance. Finally, we conclude with a discussion of ethics centered around our prototype: Exploring Intelligent Writing Assistance.

Read the report →

FF22

Inferring Concept Drift Without Labeled Data

Concept drift occurs when the statistical properties of a target domain change overtime causing model performance to degrade. Drift detection is generally achieved by monitoring a performance metric of interest and triggering a retraining pipeline when that metric falls below some designated threshold. However, this approach assumes ample labeled data is available at prediction time - an unrealistic constraint for many production systems. In this report, we explore various approaches for dealing with concept drift when labeled data is not readily accessible.

Read the report →

FF19

Session-based Recommender Systems

Being able to recommend an item of interest to a user (based on their past preferences) is a highly relevant problem in practice. A key trend over the past few years has been session-based recommendation algorithms that provide recommendations solely based on a user’s interactions in an ongoing session, and which do not require the existence of user profiles or their entire historical preferences. This report explores a simple, yet powerful, NLP-based approach (word2vec) to recommend a next item to a user. While NLP-based approaches are generally employed for linguistic tasks, here we exploit them to learn the structure induced by a user’s behavior or an item’s nature.

Read the report →

FF18

Few-Shot Text Classification

Text classification can be used for sentiment analysis, topic assignment, document identification, article recommendation, and more. While dozens of techniques now exist for this fundamental task, many of them require massive amounts of labeled data in order to be useful. Collecting annotations for your use case is typically one of the most costly parts of any machine learning application. In this report, we explore how latent text embeddings can be used with few (or even zero) training examples and provide insights into best practices for implementing this method.

Read the report →

Prototypes

Machine learning prototypes and interactive notebooks

Notebook

ASR with Whisper

Explore the capabilities of OpenAI's Whisper for automatic speech recognition by creating your own voice recordings!

https://colab.research.google.com/github/fastforwardlabs/whisper-openai/blob/master/WhisperDemo.ipynb

Library

NeuralQA

A usable library for question answering on large datasets.

https://neuralqa.fastforwardlabs.com

Notebook

Explain BERT for Question Answering Models

Tensorflow 2.0 notebook to explain and visualize a HuggingFace BERT for Question Answering model.

https://colab.research.google.com/drive/1tTiOgJ7xvy3sjfiFC9OozbjAX1ho8WN9?usp=sharing

Notebooks

NLP for Question Answering

Ongoing posts and code documenting the process of building a question answering model.

https://qa.fastforwardlabs.com

Cloudera Fast Forward Labs

Making the recently possible useful.

Cloudera Fast Forward Labs is an applied machine learning research group. Our mission is to empower enterprise data science practitioners to apply emergent academic research to production machine learning use cases in practical and socially responsible ways, while also driving innovation through the Cloudera ecosystem. Our team brings thoughtful, creative, and diverse perspectives to deeply researched work. In this way, we strive to help organizations make the most of their ML investment as well as educate and inspire the broader machine learning and data science community.

Cloudera Blog Twitter

Mar 30, 2016 · interview

Shivon Zilis on the Machine Intelligence Landscape

The 2014 Machine Intelligence Landscape

Fun times with the Fast Forward Labs slackbot.

Read more

Apr 6, 2016 · guest post

Mar 28, 2016 · announcement

Latest posts

Nov 15, 2022 · newsletter

CFFL November Newsletter

Nov 14, 2022 · post

Implementing CycleGAN

Oct 20, 2022 · newsletter

CFFL October Newsletter

Sep 21, 2022 · newsletter

CFFL September Newsletter

Sep 8, 2022 · post

Thought experiment: Human-centric machine learning for comic book creation

Aug 18, 2022 · newsletter

CFFL August Newsletter

Popular posts

Oct 30, 2019 · newsletter

Nov 14, 2018 · post

Apr 10, 2018 · post

Oct 4, 2017 · post

Aug 22, 2016 · whitepaper

Feb 24, 2016 · post

Reports

FF24

FF22

FF19

FF18

Prototypes

Notebook

Library

Notebook

Notebooks

Cloudera Fast Forward Labs