The convergence of Artificial Intelligence (AI) with DevOps, DataOps, and MLOps has transformed the software development lifecycle, enabling scalable, automated, and intelligent systems. This paper explores the transition from traditional DevOps to MLOps, emphasizing the integration of machine learning workflows into continuous integration, deployment, and training pipelines. We present a practical framework for implementing MLOps using tools such as MLflow, Airflow, and Kubernetes, and address challenges like overfitting, underfitting, and model drift. The proposed architecture leverages Docker and ONNX for model packaging and deployment, ensuring reproducibility and cross-platform compatibility. Through real-world examples and pipeline automation strategies, we demonstrate how MLOps enhances model reliability, governance, and performance monitoring in dynamic environments. This study contributes to the growing body of knowledge on AI-driven DevOps by offering actionable insights for researchers and practitioners aiming to build robust ML systems. Build an Apache Airflow pipeline to load, train, and evaluate a ML model, store it, and use it for inferencing by deploying the model with a sleek Streamlit UI, Docker, and auto-scale it with Kubernetes as container orchestration tool. Techniques for implementing and automating continuous integration (CI), continuous delivery (CD), and continuous training (CT) for machine learning (ML) systems. This document applies primarily to predictive AI systems.
| Published in | American Journal of Artificial Intelligence (Volume 9, Issue 2) |
| DOI | 10.11648/j.ajai.20250902.29 |
| Page(s) | 297-309 |
| Creative Commons |
This is an Open Access article, distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution and reproduction in any medium or format, provided the original work is properly cited. |
| Copyright |
Copyright © The Author(s), 2025. Published by Science Publishing Group |
Artificial Intelligence, DevOps, MLOps, Overfitting, Docker, Kubernetes, DataOps, Machine Learning Lifecycle
Aspect | DevOps | MLOps |
|---|---|---|
Main Product | Application source code | ML model + data |
Testing | Unit test, integration | Data validation, model evaluation |
Deployment | CI/CD | CI/CD + model registry + feature store |
Monitoring | Logs, system performance | Model drift, data quality |
Versioning | Source code | Source code + data + model |
Method | Description |
|---|---|
Increase training data | Helps model learn more general patterns |
Regularization (L1/L2) | Adds constraints to reduce model complexity |
Dropout (for neural nets) | Randomly removes nodes during training |
Early stopping | Stops training when test performance declines |
Simplify model | Use fewer parameters or simpler architecture |
Cross-validation | Evaluate model on multiple test sets |
MLOps Component | Role in Overfitting Mitigation |
|---|---|
Extended CI/CT/CD | Automatically retrain with new data |
Model testing | Evaluate stability across multiple test sets |
Model monitoring | Detect drift and alert on accuracy drop |
Model packaging | Ensure deployed model is well-tested |
Feature engineering | Reduce noise and improve generalization |
Sign | Meaning |
|---|---|
Low accuracy on both train/test | Model fails to learn patterns |
High loss that doesn’t decrease | Training is insufficient |
No improvement with more data | Model lacks capacity to learn |
Method | Description |
|---|---|
Increase model complexity | Add layers or nodes |
Train longer | More epochs or lower learning rate |
Add features | Create meaningful input variables |
Reduce regularization | Avoid overly simplifying the model |
Check preprocessing | Ensure no important data is lost |
Method | Characteristics |
|---|---|
One-hot | Binary vector, no semantic meaning |
Word2Vec | Learns meaning from context |
GloVe | Based on global corpus statistics |
FastText | Considers subword structure |
BERT/Transformer | Contextual embeddings |
Metric | Description | Example Benchmark Target |
|---|---|---|
Model Deployment Latency | Time from model validation to production deployment | < 30 seconds (Kubernetes-based) |
Retraining Responsiveness | Time to detect drift and trigger retraining | < 5 minutes (Prometheus alert) |
Reproducibility Score | Percentage of experiments that can be re-run with identical results | > 95% (MLflow tracking) |
Monitoring Coverage | Number of metrics tracked (accuracy, latency, fairness, drift, etc.) | ≥ 5 core metrics |
Criteria | Manual Deployment | Traditional DevOps | Proposed MLOps Pipeline |
|---|---|---|---|
Deployment Latency | 1–3 days (manual packaging) | 2–6 hours (CI/CD for code only) | < 30 seconds (Docker + Kubernetes) |
Retraining Trigger Time | Manual, ad-hoc | Not supported | < 5 minutes (Airflow + Prometheus) |
Reproducibility | Low (no version control) | Medium (code only) | High (>95%, MLflow registry) |
Monitoring Coverage | Basic (CPU, RAM) | System-level only | Full (accuracy, drift, fairness) |
Model Versioning | Manual file naming | Git-based (code only) | MLflow + ONNX (code + data + model) |
Scalability | Limited (single machine) | Moderate (CI/CD for apps) | High (Kubernetes auto-scaling) |
Rollback Capability | Manual restore | Git revert (code only) | MLflow rollback + container image |
Compliance & Auditability | Poor | Limited | Strong (metadata + lineage logs) |
AI | Artificial Intelligence |
ML | Machine Learning |
MLOps | Machine Learning Operations |
CI/CD/CT | Continuous Integration / Continuous Deployment / Continuous Training |
ONNX | Open Neural Network Exchange |
DAG | Directed Acyclic Graph |
PVC | Persistent Volume Claim |
API | Application Programming Interface |
NLP | Natural Language Processing |
Aspect | DevOps | MLOps |
|---|---|---|
Focus | Software development and deployment | Machine learning model lifecycle |
Artifacts | Application code, binaries | Data, models, metrics, code |
Versioning | Source code and build artifacts | Code, datasets, model versions |
Testing | Unit, integration, and system tests | Data validation, model evaluation, bias detection |
CI/CD | Continuous integration and deployment of software | Continuous training, validation, and deployment of models |
Monitoring | Application performance, uptime | Model drift, prediction accuracy, data quality |
Tools | Jenkins, Docker, Kubernetes | MLflow, Airflow, TensorBoard, Kubeflow |
Challenges | Deployment speed, rollback, scaling | Data dependency, reproducibility, model explainability |
| [1] |
Codezup, “Building a Robust MLOps Pipeline: A Step-by-Step Guide,” May 3, 2025. Available:
https://codezup.com/building-robust-mlops-pipeline-step-by-step-guide/ |
| [2] | Qu Xiangjie, “Build an end-to-end MLOps pipeline with Air-flow, Streamlit, Docker, and Kubernetes,” Oct. 4, 2025. Available: |
| [3] | M. Zaharia et al., “MLflow: Accelerating the machine learning lifecycle,” 2020. Available: |
| [4] | Google Cloud, “MLOps: Continuous delivery and automation pipelines in machine learning,” 2023. Available: |
| [5] | V. Lakshmanan, Practical MLOps: Operationalizing machine learning models, Sebastopol, CA: O'Reilly Media, 2022. |
| [6] | Seldon, “Monitoring and managing ML models in production,” 2023. Available: |
| [7] | MLflow, “MLflow Documentation,” 2020. Available: |
| [8] | Airbnb Engineering, “Automating ML Pipelines for Real-Time Recommendations,” 2023. Available: |
| [9] | Arxiv, “Multivocal Review on MLOps Tooling Fragmentation,” 2024. Available: |
| [10] | Facebook Prophet, “Forecasting at Scale,” 2022. Available: |
| [11] | Hugging Face, “Transformers Documentation,” 2023. Available: |
| [12] | OpenAI Gym, “Toolkit for Developing and Comparing Reinforcement Learning Algorithms,” 2022. Available: |
| [13] | Philips, “AI-Powered Diagnostic Imaging with MLOps,” 2023. Available: |
| [14] | Ray Project, “Distributed Hyperparameter Tuning with Ray Tune,” 2023. Available: |
| [15] | Unity ML-Agents Toolkit, “Training Intelligent Agents,” 2022. Available: |
APA Style
Minh, T. Q., Lan, N. T., Phuong, L. T., Cuong, N. C., Tam, D. C. (2025). Building Scalable MLOps Pipelines with DevOps Principles and Open-Source Tools for AI Deployment. American Journal of Artificial Intelligence, 9(2), 297-309. https://doi.org/10.11648/j.ajai.20250902.29
ACS Style
Minh, T. Q.; Lan, N. T.; Phuong, L. T.; Cuong, N. C.; Tam, D. C. Building Scalable MLOps Pipelines with DevOps Principles and Open-Source Tools for AI Deployment. Am. J. Artif. Intell. 2025, 9(2), 297-309. doi: 10.11648/j.ajai.20250902.29
AMA Style
Minh TQ, Lan NT, Phuong LT, Cuong NC, Tam DC. Building Scalable MLOps Pipelines with DevOps Principles and Open-Source Tools for AI Deployment. Am J Artif Intell. 2025;9(2):297-309. doi: 10.11648/j.ajai.20250902.29
@article{10.11648/j.ajai.20250902.29,
author = {Trinh Quang Minh and Ngo Thi Lan and Lam Tan Phuong and Nguyen Chi Cuong and Do Chi Tam},
title = {Building Scalable MLOps Pipelines with DevOps Principles and Open-Source Tools for AI Deployment},
journal = {American Journal of Artificial Intelligence},
volume = {9},
number = {2},
pages = {297-309},
doi = {10.11648/j.ajai.20250902.29},
url = {https://doi.org/10.11648/j.ajai.20250902.29},
eprint = {https://article.sciencepublishinggroup.com/pdf/10.11648.j.ajai.20250902.29},
abstract = {The convergence of Artificial Intelligence (AI) with DevOps, DataOps, and MLOps has transformed the software development lifecycle, enabling scalable, automated, and intelligent systems. This paper explores the transition from traditional DevOps to MLOps, emphasizing the integration of machine learning workflows into continuous integration, deployment, and training pipelines. We present a practical framework for implementing MLOps using tools such as MLflow, Airflow, and Kubernetes, and address challenges like overfitting, underfitting, and model drift. The proposed architecture leverages Docker and ONNX for model packaging and deployment, ensuring reproducibility and cross-platform compatibility. Through real-world examples and pipeline automation strategies, we demonstrate how MLOps enhances model reliability, governance, and performance monitoring in dynamic environments. This study contributes to the growing body of knowledge on AI-driven DevOps by offering actionable insights for researchers and practitioners aiming to build robust ML systems. Build an Apache Airflow pipeline to load, train, and evaluate a ML model, store it, and use it for inferencing by deploying the model with a sleek Streamlit UI, Docker, and auto-scale it with Kubernetes as container orchestration tool. Techniques for implementing and automating continuous integration (CI), continuous delivery (CD), and continuous training (CT) for machine learning (ML) systems. This document applies primarily to predictive AI systems.},
year = {2025}
}
TY - JOUR T1 - Building Scalable MLOps Pipelines with DevOps Principles and Open-Source Tools for AI Deployment AU - Trinh Quang Minh AU - Ngo Thi Lan AU - Lam Tan Phuong AU - Nguyen Chi Cuong AU - Do Chi Tam Y1 - 2025/12/11 PY - 2025 N1 - https://doi.org/10.11648/j.ajai.20250902.29 DO - 10.11648/j.ajai.20250902.29 T2 - American Journal of Artificial Intelligence JF - American Journal of Artificial Intelligence JO - American Journal of Artificial Intelligence SP - 297 EP - 309 PB - Science Publishing Group SN - 2639-9733 UR - https://doi.org/10.11648/j.ajai.20250902.29 AB - The convergence of Artificial Intelligence (AI) with DevOps, DataOps, and MLOps has transformed the software development lifecycle, enabling scalable, automated, and intelligent systems. This paper explores the transition from traditional DevOps to MLOps, emphasizing the integration of machine learning workflows into continuous integration, deployment, and training pipelines. We present a practical framework for implementing MLOps using tools such as MLflow, Airflow, and Kubernetes, and address challenges like overfitting, underfitting, and model drift. The proposed architecture leverages Docker and ONNX for model packaging and deployment, ensuring reproducibility and cross-platform compatibility. Through real-world examples and pipeline automation strategies, we demonstrate how MLOps enhances model reliability, governance, and performance monitoring in dynamic environments. This study contributes to the growing body of knowledge on AI-driven DevOps by offering actionable insights for researchers and practitioners aiming to build robust ML systems. Build an Apache Airflow pipeline to load, train, and evaluate a ML model, store it, and use it for inferencing by deploying the model with a sleek Streamlit UI, Docker, and auto-scale it with Kubernetes as container orchestration tool. Techniques for implementing and automating continuous integration (CI), continuous delivery (CD), and continuous training (CT) for machine learning (ML) systems. This document applies primarily to predictive AI systems. VL - 9 IS - 2 ER -