cortex

module

v0.29.0 Latest Latest Go to latest Published: Feb 16, 2021 License: Apache-2.0

Details

Valid go.mod file
Redistributable license
Tagged version
Stable version
Learn more about best practices

Repository

github.com/cortexlabs/cortex

Links

Open Source Insights

README ¶

Run inference at scale

Cortex is an open source platform for large-scale machine learning inference workloads.

Workloads

Realtime APIs - respond to prediction requests in real-time

Deploy TensorFlow, PyTorch, and other models.
Scale to handle production workloads with server-side batching and request-based autoscaling.
Configure rolling updates and live model reloading to update APIs without downtime.
Serve many models efficiently with multi-model caching.
Perform A/B tests with configurable traffic splitting.
Stream performance metrics and structured logs to any monitoring tool.

Batch APIs - run distributed inference on large datasets

Deploy TensorFlow, PyTorch, and other models.
Configure the number of workers and the compute resources for each worker.
Recover from failures with automatic retries and dead letter queues.
Stream performance metrics and structured logs to any monitoring tool.

How it works

Implement a Predictor

# predictor.py

from transformers import pipeline

class PythonPredictor:
    def __init__(self, config):
        self.model = pipeline(task="text-generation")

    def predict(self, payload):
        return self.model(payload["text"])[0]

Configure a realtime API

# text_generator.yaml

- name: text-generator
  kind: RealtimeAPI
  predictor:
    type: python
    path: predictor.py
  compute:
    gpu: 1
    mem: 8Gi
  autoscaling:
    min_replicas: 1
    max_replicas: 10

Deploy

$ cortex deploy text_generator.yaml

# creating http://example.com/text-generator

Serve prediction requests

$ curl http://example.com/text-generator -X POST -H "Content-Type: application/json" -d '{"text": "hello world"}'

Get started

Directories ¶

Path	Synopsis
cli
cluster
cmd
lib/routines
types/cliconfig
types/flags
dev
pkg
consts
lib/archive
lib/aws
lib/cache
lib/cast
lib/configreader
lib/console
lib/cron
lib/debug
lib/docker
lib/errors
lib/exit
lib/files
lib/gcp
lib/hash
lib/json
lib/k8s
lib/maps
lib/math
lib/msgpack
lib/parallel
lib/pointer
lib/print
lib/prompt
lib/random
lib/regex
lib/sets/strset
lib/sets/strset/threadsafe
lib/slices
lib/strings
lib/table
lib/telemetry
lib/time
lib/urls
operator
operator/config
operator/endpoints
operator/lib/exit
operator/lib/logging
operator/lib/routines
operator/operator
operator/resources
operator/resources/job
operator/resources/job/batchapi
operator/resources/job/taskapi
operator/resources/realtimeapi
operator/resources/trafficsplitter
operator/schema
types
types/clusterconfig
types/clusterstate
types/metrics
types/spec
types/status
types/userconfig

?	: This menu
/	: Search site
f or F	: Jump to
y or Y	: Canonical URL