Workflow Element Store

  1. Unstructured data (Audio)
  2. Data Pre-existing
  3. Structured Data (Tabular)
  4. Data Collaboration and Partnerships
  5. Public Datasets
  6. Mobile Applications or IoT Applications
  7. APIs and Data Feeds
  8. Surveys and Questionnaires
  9. Data Logging
  10. Data Generation
  11. Unstructured data (Images / Videos)
  12. WebScraping
  13. Crowdsourcing
  1. GCP BigQuery
  2. MySQL
  3. PostgreSQL
  4. S3
  5. AWS Redshift
  6. Azure blob storage
  7. Oracle DB
  8. MS SQL server
  9. NoSQL DB
  10. RDBMS
  11. Azure Data Warehouse
  12. GCS
  13. Informatica
  1. Polynomial Features
  2. Handling Time-Series Data
  3. Feature Selection
  4. Handling Missing Data
  5. Data Scaling and Normalization
  6. Logarithmic Transform
  7. Interaction Features
  8. Dealing with Outliers
  9. Auto-Preprocessing libraries
  10. Binning
  11. Dimensionality Reduction
  12. Encoding Categorical Variables
  13. Textual Feature Extraction
  14. Feature Extraction from Images
  15. AutoEDA libraries
  16. Dimensionality Reduction
  17. Handling Imbalanced Classes
  18. Handling Categorical Data
  19. Data Scaling and Normalization
  20. Handling Noisy Data
  21. Time-Based Features
  22. Domain-Specific Feature Engineering
  1. Ensemble Techniques
  2. Supervised Learning-binary classification
  3. Supervised Learning-multiclass classification
  4. Blackbox Techniques
  5. Supervised Learning-Regression
  6. Time Series Anaysis
  7. Unsupervised Learning
  8. Forecasting
  9. Train-Test Split
  10. Data Partitioning
  1. Data Partition-sequential
  2. Regularization
  3. Hyperparameter Tuning
  4. Regular Monitoring and Logging
  5. Early Stopping
  6. Gradient Clipping
  7. Transfer Learning
  8. Ensemble Methods
  9. Weight Initialization
  10. Cross-Validation
  11. Learning Rate Scheduling
  12. Train-Test Split
  13. Batch Size Selection
  14. Batch Normalization
  15. Data Augmentation
  1. Train-Test Split
  2. Data Partitioning
  3. External Validation
  4. Hyperparameter Tuning
  5. Regularization Techniques
  6. Evaluation Metrics
  7. Model Comparison
  8. Model Interpretability
  9. Cross-Validation
  10. Performance Visualization
  1. Serverless Computing
  2. Data Drift Monitoring
  3. Model Versioning
  4. Model Drift
  5. Prediction Logging
  6. Model Health Monitoring
  7. Web APIs - Flask, FastAPI, etc.
  8. Model Registry
  9. Documentation and API Documentation
  10. Model Serialization
  11. Bias and Fairness Assessment
  12. Feedback Collection
  13. Edge Deployment
  14. Security Considerations
  15. Error Analysis
  16. Containerization
  17. Continuous Integration and Deployment (CI/CD)
  18. Monitoring and Logging
  19. A/B Testing
  20. Performance Metrics
  21. Alerting and Notification
  22. Cloud Deployment
  23. Streamlit
  24. Documentation and Reporting
  25. Concept Drift Detection
  26. Model Retraining and Updating
  27. Model Monitoring and Maintenance
  1. End User Machine
  2. Mobile
ML Workflow Beginner - Architecture
  • Element belongs to model
  • Element not belongs to model
Feature Store

Feature Store
(Online / Offline)

Data Sources

Data Sources

Data Warehouse

Data Warehouse/ Data Lake

Data Pre Processing & Feature Engineering

EDA, Data Pre Processing & Feature Engineering

Model Selection

Model Selection

Model Training & Hyper Parameter Tuning

Model Training & Hyper Parameter Tuning

Model Evaluation

Model Evaluation

Model Deployment

Model Deployment

End User Device

End User Device

Model Registry

Model Registry