Hi, I'm Martin 👋

Software & Cloud Engineer. I help businesses reduce costs, improve performance, latency, and stability in production systems.

MK

Services

Cloud Infrastructure Performance Review

What I Offer

Infrastructure and system architecture review
Performance bottleneck analysis
Resource usage and cost review
Observability and monitoring assessment
Actionable recommendations with clear priorities

How I Work

I work methodically, starting with visibility and data before touching anything.

Instead of trying to fix everything at once, I identify the few issues that create most of the cost, latency, or instability. The focus is on understanding where the system is inefficient and why.

Most teams get clear improvement directions within the first review, without committing to long-term work or risky changes.

I communicate findings clearly and stay available for questions during the review period.

€500 per review

One-time engagement

Typical turnaround: 5–7 days

Consulting Call

Have a specific performance or cloud question? I'll review your situation and explain what's likely going wrong, what matters, and what doesn't.

Custom Engagements

Need something different? I also offer follow-up implementation help, deeper reviews, or ongoing advisory work depending on your needs and constraints.

Work Experience

Artifimo

May 2025 - Present

Founded a bespoke AI automation & integration agency, helping businesses cut costs and get ahead with AI. Worked on projects for clients ranging from complex, deep AI integrations with LangChain and transformers.js, to highly optimized, human-like AI Voice Support Agents.

Imagga

October 2024 - April 2025

GenAI Researcher

Worked as a Generative AI Researcher at Imagga, focusing on virtual try-on technology using Python and generative AI models. Gained hands-on experience in building and optimizing image generation models within the ComfyUI framework for VTON (Virtual Try-On). Contributed to advancing AI-driven image generation solutions, honing technical skills in model fine-tuning, testing, and deployment for commercial applications.

Imagga

June 2024 - October 2024

Worked as a Generative AI Intern and explored image synthesis techniques and contributed to virtual try-on projects. Gained experience with various generative AI tools and libraries, experimenting with model configurations and data processing. Collaborated with the team on testing and evaluating model outputs, enhancing my understanding of AI applications in e-commerce.

Contributions

Previous Work

Results that speak for themselves

From AI platforms to mobile apps, I've helped teams achieve measurable improvements in performance, scalability, and cost efficiency.

Artifimo

Artifimo

Built a complete AI automation platform from scratch. Achieved sub-200ms response times on LLM orchestration and 99.9% uptime across all client deployments.

Actiko

Actiko

Implemented intelligent caching and RAG optimization that reduced API costs by 65% while improving response quality scores by 40%.

VOWCE

VOWCE

Optimized speech-to-text pipeline achieving real-time transcription with 95% accuracy. Reduced app bundle size by 35% through code splitting.

JobCue

JobCue

Architected scalable interview processing system handling 1000+ concurrent sessions. Reduced infrastructure costs by 50% through smart resource allocation.

Postmate

Postmate

Built high-throughput content generation pipeline. Implemented queue workers that process 10,000+ posts daily with zero downtime.

Sentimenty

Sentimenty

Delivered enterprise-grade feedback system with real-time analytics. Achieved 60ms average page load through edge caching and optimization.

CloseUp.Pics

CloseUp.Pics

Engineered GPU inference pipeline with 3x faster image generation. Built monitoring stack that reduced debugging time by 80%.

IrreglY

IrreglY

Developed scalable mobile reporting system serving thousands of users. Implemented efficient geospatial queries with sub-100ms response times.

Artifimo

Artifimo

Built a complete AI automation platform from scratch. Achieved sub-200ms response times on LLM orchestration and 99.9% uptime across all client deployments.

Actiko

Actiko

Implemented intelligent caching and RAG optimization that reduced API costs by 65% while improving response quality scores by 40%.

VOWCE

VOWCE

Optimized speech-to-text pipeline achieving real-time transcription with 95% accuracy. Reduced app bundle size by 35% through code splitting.

JobCue

JobCue

Architected scalable interview processing system handling 1000+ concurrent sessions. Reduced infrastructure costs by 50% through smart resource allocation.

Postmate

Postmate

Built high-throughput content generation pipeline. Implemented queue workers that process 10,000+ posts daily with zero downtime.

Sentimenty

Sentimenty

Delivered enterprise-grade feedback system with real-time analytics. Achieved 60ms average page load through edge caching and optimization.

CloseUp.Pics

CloseUp.Pics

Engineered GPU inference pipeline with 3x faster image generation. Built monitoring stack that reduced debugging time by 80%.

IrreglY

IrreglY

Developed scalable mobile reporting system serving thousands of users. Implemented efficient geospatial queries with sub-100ms response times.

Certifications

Professional Certifications

I hold the following certifications, demonstrating my expertise and commitment to continuous learning.

U
August 2025
Building AI
University of Helsinki
M
December 2024
Career Essentials in Generative AI
Microsoft

Recent Posts

FastAPI-MCP: A Guide to Using MCP With FastAPI

Learn how to integrate FastAPI with Model Context Protocol (MCP) to instantly turn your API endpoints into agent-ready tools and workflows.

August 22, 2025 (4mo ago)

Reassessing "Zero to One" in the Age of Advanced AI

AI is advancing faster than the previous decade expected.

February 27, 2025 (10mo ago)

Let's optimize together

Ready to improve your cloud?

Get a comprehensive infrastructure review and actionable recommendations to reduce costs, improve performance, and scale with confidence.

Need to contact me via e-mail? Write to: m [at] martinkostov [dot] me