COPML - A Quick Introduction
Note: This has been adapted from a document written by myself, Jeff Fletcher and Ade Adewumni and was originally published on the Cloudera website.
More organisations are adopting machine learning (ML) and the market has matured enough to be able to offer more options and capabilities. However, in many cases the business value derived from their investment in ML has been inconsistent. This isn’t because enterprise customers have been careless in their approach to machine learning. Many were careful to conduct "science experiments" to identify which of their business challenges machine learning could be successfully applied to. The problem is that success in developing these proofs of concept (PoCs) does not necessarily translate into the successful "operationalisation" or deployment of machine learning. Further, this type of experimentation is not cheap. Outside of the so-called Big Tech firms1 few organisations can sustain extensive experimentation without clear evidence that it will contribute to the bottom line. If none or very few of the successful PoCs make it to deployment, it’s difficult for organisations to justify ongoing investment. This is true even if it’s been demonstrated, through PoCs, that machine learning can be successfully applied to the underlying business challenge.
Enterprise customers are realising that in order to see a return on their investment in machine learning, these successful ML experiments need a formalised path to deployment and a structured approach to post-deployment maintenance. They are also aware of the importance of doing this in a way that minimises costs and friction which make it harder to sustain the value of these ML applications in the medium to long term. This is where the methodology, Continuous Operations for Production Machine Learning (COPML) can help. COPML is a comprehensive approach to machine learning model development and operation that facilitates the creation of machine learning applications which deliver continuous business value for as long as possible and can be smoothly retired once this ceases to be the case.
The scope of COPML isn’t limited to model artefacts or the tools and systems that support their ongoing maintenance and ensure that they are smoothly served to downstream applications. It also supports the fostering of collaboration between the various teams involved in the machine learning workflow. This document provides an exploration of the COPML methodology, using two example use cases to further illustrate what its application could look like in practice.
-
Big Tech is used to collectively describe the most popular and best-performing American technology companies e.g. Facebook, Amazon, Microsoft, Apple, Netflix and Alphabet etc._ ↩