Architecture
Last updated
Was this helpful?
Last updated
Was this helpful?
What technologies will you use? How will the pieces fit together? That's systems architecture. Some of the tech choices have already been made; however, you will still have almost full decision-making power in the app.
The following is a simplistic view of one of the major architectural designs that apps follow.
Here’s a good explanation of 3 different architecture patterns for how to serve machine learning models.
Pattern #1: Batch predictions. You probably haven’t done this at BloomTech, but it can be a good, simple option for some forecasting problems. You make predictions in batches, ahead of time, and store them in a database.
Pattern #2: Predictions on demand, embedded into one app, “tightly coupled.” BloomTech examples include the Twitoff app in Unit 3, and Plotly Dash apps in Build Weeks.
Pattern #3: Predictions on demand, via a separate “microservice” API, “loosely coupled” with your main app.
See also Emerging Architectures for Modern Data Infrastructure by Andreessen Horowitz.
Your product will likely have user authentication, but the design & implementation will be your Web teammates responsibility, not yours. DS doesn’t need to directly do anything with Okta.
If your DS API shouldn’t be public to the internet (for example, if you’re serving private data) then DS can configure CORS (cross-origin resource sharing) to only permit calls from your Web Backend. Refer to these great docs: https://fastapi.tiangolo.com/tutorial/cors/
FastAPI has other security options if you're curious. See https://fastapi.tiangolo.com/tutorial/security/
Decide what inputs you will need. What options will the user have, to filter, sort, or customize the visualization? These inputs will usually come through a GET request, as path parameters or query parameters.
Decide what output you will provide. We need to send our Web colleagues the format they expect: JSON. Many visualization libraries are able to do this, as well as FastAPI.
First, decide what model/algorithm you will utilize. Do you need a machine learning model or just an algorithm?
Depending on the problem, you may be able to create an algorithm. No need for a mega-ton hammer, if a small mallet will do.
What questions do you want to answer? This depends on your product and users.
What questions can you answer? This depends on what data you have, and whether it's labeled.
Follow this flowchart:
Pre-trained models are useful. If you have labeled data, you can fine-tune most pre-trained models for your problem, for the best of both worlds.
ML Ops is the glue that binds the DS portion of the app together. There are few concepts that are paramount in the world of an ML Ops Engineer and one of them is component based design.
Let's say we need to create an API for the DS side of an app. We don't want to create a function in the API route as it makes the API bloated.
It's better practice to create a separate file to house your functions. In the following example, we have moved the create()
function to the data
file in the main app
directory and import it into our API