Trung BìnhAI Engineering iconAI Engineering

AI Engineer vs ML Engineer: khác biệt, skill set, khi nào cần từng vai trò?

Machine Learning Engineer (truyền thống):
- Build/train model từ data: feature engineering, train classical ML (XGBoost, scikit) hoặc deep learning (PyTorch, TF).
- Responsible cho pipeline MLOps: data ingestion, feature store, model registry, training pipeline, serving.
- Cần kiến thức sâu: linear algebra, calculus, statistics, optimization, architecture deep learning.
- Workflow: vấn đề business → data collection → feature engineering → model selection → train/eval → deploy → monitor drift → retrain.
- Artifact chính: model weights + training pipeline.

AI Engineer (post-LLM, emerged 2023+):
- Build app dùng pre-trained foundation model (LLM, VLM, diffusion) từ provider hoặc open-source.
- Không train model from scratch; focus vào prompt engineering, RAG, fine-tuning (PEFT), agents, eval, system integration.
- Cần: strong software engineering + practical knowledge về LLM behavior, prompt craft, evaluation, cost/latency optimization, guardrails.
- Workflow: business problem → prompt + RAG design → eval → deploy → monitor quality/cost → iterate prompt.
- Artifact chính: prompt + retrieval pipeline + agent orchestration, model là commodity.

Overlap: cả hai dùng python/pytorch, cần hiểu metrics và deployment. Nhiều MLE chuyển thành AIE.

Khác biệt cốt lõi:

ML EngineerAI Engineer
Model originTrain from scratch / domainUse pre-trained foundation
Data needLarge labeled datasetSmall eval set + RAG corpus
Core skillMath, ML theory, MLOpsPrompt, RAG, system design, LLM ops
DebuggingFeature, training dynamicsPrompt, retrieval, hallucination
CostTraining computeInference tokens
Failure modeModel accuracy drop, driftHallucination, jailbreak, cost spike
Typical stackPyTorch, Kubeflow, MLflowLangChain/LlamaIndex, vector DB, LLM API

Khi cần MLE:
- Task cần model chuyên biệt chưa có pre-trained (fraud, recommendation, forecasting, CTR).
- Dataset proprietary lớn, cần custom model.
- Regulated domain cần interpretability (logistic regression > black box).
- Edge/embedded cần model nhỏ custom.
- Classical tasks: time series, tabular, CV ngành hẹp.

Khi cần AIE:
- Task NLP/text generation/chatbot/Q&A/summarization → RAG + LLM.
- Code generation/review, doc processing, search.
- Agent tự động hoá workflow.
- Multi-modal (VLM, ảnh → text).
- Fast prototyping business feature AI — time to market quan trọng.

Org thực tế:

  • Start-up / mid-size product: 80-90% nhu cầu giờ là AIE (dùng API). Thuê MLE khi cần custom model.
  • Tech giant / research lab: cần cả hai, phân tầng. Research scientist train foundation → ML engineer productionize → AI engineer build feature trên đó.
  • Team role thường gộp ở công ty nhỏ — "ML Engineer" làm cả hai, gọi tên "AI/ML Engineer" phổ biến.

Skill path để transition ML → AI engineer (phổ biến 2024-2025):
1. Hiểu transformer architecture ở mức đủ (không cần train from scratch).
2. Thành thạo prompt engineering, few-shot, CoT.
3. Build RAG end-to-end (chunking, embedding, vector DB, reranker).
4. Fine-tune với PEFT (LoRA).
5. Eval framework (RAGAS, LLM-judge).
6. LLMOps (LiteLLM, Langfuse, cost/latency opt).
7. Agent pattern (ReAct, tool use, MCP).
8. System design AI app end-to-end.

Thách thức ở từng vai trò:
- MLE: model drift, retraining cost, distribution shift.
- AIE: prompt brittleness, hallucination, provider dependency, cost unpredictability, jailbreak.

Xu hướng 2025+: ranh giới mờ dần. Nhiều AIE fine-tune model (LoRA); nhiều MLE phải serve LLM. Công ty tuyển "AI Engineer" thường mong cả hai.

Xem toàn bộ AI Engineering cùng filter theo level & chủ đề con.

Mở danh sách AI Engineering