Featured Researcher • Gen-Verse LatentMAS

Production-Grade
AI & MLOps Systems

Arifuzzaman Joy — AI/ML Engineer & MLOps Specialist

Delivering scalable AI systems with measurable business impact. Specialized in Generative AI, Multi-Agent Systems, and cloud-native ML infrastructure. 6+ years transforming research into production.

7 Publications
6+ Years Experience
15+ Production Systems
Case Studies

Selected Projects

Production-grade AI systems with proven performance improvements and cost savings

2025

AI Calling Agent Platform

Real-time voice conversation platform with SIP/WebRTC telephony integration. Achieved sub-500ms latency with emotionally expressive speech synthesis.

LiveKit GPT-4 WebRTC RunPod
2025

Multi-GPU Video Generation

Distributed inference pipeline for state-of-the-art video generation (text-to-video, image-to-video, speech-to-video). Achieved 3× throughput increase using FSDP on serverless GPU clusters.

PyTorch FSDP Modal DiT
2024

Custom LoRA Training Pipeline

Self-hosted Flux.1 Dev with custom LoRA fine-tuning infrastructure. Delivered 80% cost reduction for client photography workflows.

Flux LoRA Gradio Modal
2024

Enterprise RAG System

Multi-modal retrieval-augmented generation with vector search and semantic chunking. Achieved 40% accuracy improvement over baseline implementation.

LangChain Pinecone GPT-4 FastAPI
2024

Voice-Pro: Speech Processing Platform

Web application for speech recognition, translation, and voice cloning across 100+ languages. Supports YouTube processing and real-time translation.

Whisper F5-TTS Deep-Translator Python
2024

Medical Imaging with Transformers

Brain tumor classification and segmentation using ConvNeXt V2 and SegFormer. Achieved 99.6% diagnostic accuracy on evaluation dataset.

PyTorch ConvNeXt V2 SegFormer Transformers
Career

Professional Experience

Building production AI systems and conducting applied research

AI & Machine Learning Engineer

Freelance — Multiple Clients

2023 — Present

  • Develop and deploy cutting-edge ML/AI models specializing in multi-modal tasks including image generation, video synthesis, NLP, and voice AI
  • Design and implement serverless GPU infrastructure with Docker and Kubernetes, achieving 60%+ cost reduction
  • Build production RAG systems and multi-agent frameworks with measurable performance improvements

Research Assistant

Rajshahi University — Solar Lab / AI Lab

Mar 2022 — May 2023

  • Conducted research on renewable energy (solar cells) and speech processing using ML/DL techniques
  • Applied machine learning to analyze simulation data and improve photovoltaic performance
  • Published 4 peer-reviewed papers in Q1 journals with impact factors up to 7.1
Technical Expertise

Skills & Technologies

Full-stack ML engineering with production-grade tools and frameworks

AI & Machine Learning

Generative AI LLMs Multi-Agent Systems Deep Learning NLP Computer Vision RAG

MLOps & Cloud

Docker Kubernetes CI/CD AWS Azure ML Monitoring Logging

Frameworks & Tools

PyTorch TensorFlow HuggingFace LangChain vLLM FastAPI Gradio

Serverless GPU

RunPod Modal Replicate Lambda Labs FSDP DeepSpeed

Languages

Python SQL JavaScript Bash MATLAB

LLMs & Models

GPT-4 Claude Llama Qwen Flux LoRA/QLoRA
Research

Publications

7 peer-reviewed publications • 4 Q1 journals (IF up to 7.1) • Google Scholar Profile

Unleashing the Power of Open-Source Transformers in Medical Imaging

Int'l Journal of Advanced Computer Science & Applications, 2024

SCI-Indexed 99.6% Accuracy

Numerical prediction on the photovoltaic performance of CZTS-based thin film solar cell

Nano Select, 2023

Scopus

Spectrum estimation for voiced speech using average weighted linear prediction

2024

Speech Processing

Enhancement of Bone Conducted Speech Using Deep Transfer Learning

2024

Deep Learning
Contact

Get in Touch

Open to AI/ML Engineering roles, MLOps consulting, and collaborative research projects

Phone

+880 1521 417908