Introduction
TensorOpera Open Source library : https://github.com/FedML-AI/FedML/
TensorOpera®Launch: a super launcher and scheduler for running any AI jobs in Any GPU Clouds or On-prem Cluster:
fedml/computing/scheduler
-- TensorOpera®Launch: a unified launcher and scheduler for running any AI jobs across any GPU clouds and on-prem cluster.
TensorOpera®Deploy: model deployment:
fedml/serving
-- TensorOpera®Deploy: scalable model serving and deploymentfedml/serving/scalellm
- an efficient engine for scaling LLM in low-memory GPUs
TensorOpera®Train: model training and observability:
fedml/train
-- training framework, especially for LLM and generative AIsfedml/train/cross_cloud
-- cross-cloud distributed training, a spotlight project for it is "UnitedLLM"fedml/train/llm
-- LLM training and fine-tuning
fedml/core/mlops
-- training experimental tracking: logging and observability
TensorOpera®Federate: federated learning and analytics framework:
fedml/cross_device
-- cross-device federated learning (FL), tailored for the smartphone-based FLfedml/cross_silo
-- cross-silo federated learningfedml/simulation
-- simulator for federated learningfedml/fa
-- federated analytics
fedml/api
-- TensorOpera public Python APIs: fedml.api., including APIs for launch, training, deployment, and federated learning. Note: at this stage, it doesn't cover all TensorOpera APIs; some others are in fedml.core. and fedml.*.
fedml/cli
-- command line interface (CLIs) implementation for TensorOpera library
fedml/core
-- core modules shared by all training, deployment, and federated learning frameworks, including features related to communication backend, security/privacy, design pattern, logging, etc.
fedml/mlops
-- MLOps (TensorOpera AI) related features
fedml/device
-- GPU placement and management
fedml/data
-- prebuilt data loaders
fedml/model
-- prebuilt models
fedml/ml
-- integrating with many other general machine learning frameworks
fedml/utils
-- common utilities
TensorOpera Spotlight Projects:
FedML/python/spotlight_prj/UnitedLLM
-- a cross-cloud distributed training framework for large-scale language models (LLMs)FedML/python/spotlight_prj/FedLLM
-- a federated learning framework for LLMs
and more...
TensorOpera Examples (Prebuilt Jobs in Jobs Store):
FedML/python/examples
-- examples for training, deployment, and federated learningFedML/python/examples/launch
-- examples for TensorOpera®LaunchFedML/python/examples/serving
-- examples for TensorOpera®DeployFedML/python/examples/train
-- examples for TensorOpera®TrainFedML/python/examples/cross_cloud
-- examples for TensorOpera®Train cross-cloud distributed trainingFedML/python/examples/federate/prebuilt_jobs
-- examples for federated learning prebuilt jobs (FedCV, FedNLP, FedGraphNN, Healthcare, etc.)FedML/python/examples/federate/cross_silo
-- examples for cross-silo federated learningFedML/python/examples/federate/cross_device
-- examples for cross-device federated learningFedML/python/examples/federate/simulation
-- examples for federated learning simulationFedML/python/examples/federate/security
-- examples for TensorOpera®Federate security related featuresFedML/python/examples/federate/privacy
-- examples for TensorOpera®Federate privacy related featuresFedML/python/examples/federate/federated_analytics
-- examples for TensorOpera®Federate federated analytics (FA)