Workflow Element Store

  1. APIs and Data Feeds
  2. Data Collaboration and Partnerships
  3. Unstructured data (Audio)
  4. Data Pre-existing
  5. Structured Data (Tabular)
  6. Public Datasets
  7. Surveys and Questionnaires
  8. Data Logging
  9. Data Generation
  10. Crowdsourcing
  11. Unstructured data (Images / Videos)
  12. Mobile Applications or IoT Applications
  13. WebScraping
  1. AWS Redshift
  2. Azure Data Warehouse
  3. PostgreSQL
  4. Azure blob storage
  5. MS SQL server
  6. S3
  7. MySQL
  8. Oracle DB
  9. Informatica
  10. GCP BigQuery
  11. NoSQL DB
  12. RDBMS
  13. GCS
  1. Textual Feature Extraction
  2. Handling Categorical Data
  3. Binning
  4. Logarithmic Transform
  5. Handling Imbalanced Classes
  6. Dimensionality Reduction
  7. Handling Time-Series Data
  8. Domain-Specific Feature Engineering
  9. Encoding Categorical Variables
  10. Handling Missing Data
  11. Handling Noisy Data
  12. Data Scaling and Normalization
  13. Interaction Features
  14. Polynomial Features
  15. AutoEDA libraries
  16. Feature Selection
  17. Data Scaling and Normalization
  18. Time-Based Features
  19. Dimensionality Reduction
  20. Auto-Preprocessing libraries
  21. Dealing with Outliers
  22. Feature Extraction from Images
  1. Unsupervised Learning
  2. Data Partitioning
  3. Blackbox Techniques
  4. Supervised Learning-multiclass classification
  5. Forecasting
  6. Supervised Learning-binary classification
  7. Time Series Anaysis
  8. Ensemble Techniques
  9. Supervised Learning-Regression
  10. Train-Test Split
  1. Train-Test Split
  2. Learning Rate Scheduling
  3. Data Partition-sequential
  4. Data Augmentation
  5. Early Stopping
  6. Weight Initialization
  7. Gradient Clipping
  8. Batch Size Selection
  9. Hyperparameter Tuning
  10. Batch Normalization
  11. Transfer Learning
  12. Regular Monitoring and Logging
  13. Cross-Validation
  14. Ensemble Methods
  15. Regularization
  1. Model Comparison
  2. Performance Visualization
  3. Train-Test Split
  4. Model Interpretability
  5. Regularization Techniques
  6. Evaluation Metrics
  7. Data Partitioning
  8. Cross-Validation
  9. External Validation
  10. Hyperparameter Tuning
  1. Feedback Collection
  2. Data Drift Monitoring
  3. Serverless Computing
  4. Model Versioning
  5. Monitoring and Logging
  6. Edge Deployment
  7. Documentation and API Documentation
  8. Performance Metrics
  9. Model Monitoring and Maintenance
  10. Model Registry
  11. Model Health Monitoring
  12. Model Serialization
  13. Concept Drift Detection
  14. Documentation and Reporting
  15. Security Considerations
  16. Model Retraining and Updating
  17. Error Analysis
  18. Cloud Deployment
  19. Model Drift
  20. Streamlit
  21. A/B Testing
  22. Prediction Logging
  23. Web APIs - Flask, FastAPI, etc.
  24. Alerting and Notification
  25. Bias and Fairness Assessment
  26. Continuous Integration and Deployment (CI/CD)
  27. Containerization
  1. End User Machine
  2. Mobile
ML Workflow Beginner - Architecture
  • Element belongs to model
  • Element not belongs to model
Feature Store

Feature Store
(Online / Offline)

Data Sources

Data Sources

Data Warehouse

Data Warehouse/ Data Lake

Data Pre Processing & Feature Engineering

EDA, Data Pre Processing & Feature Engineering

Model Selection

Model Selection

Model Training & Hyper Parameter Tuning

Model Training & Hyper Parameter Tuning

Model Evaluation

Model Evaluation

Model Deployment

Model Deployment

End User Device

End User Device

Model Registry

Model Registry