Writing

Technical articles and insights on AI, machine learning, and vector databases.

About My Writing

I write about cutting-edge developments in AI/ML, with a focus on:

  • Vector Databases & Semantic Search - Deep dives into the technology powering modern AI applications
  • Retrieval-Augmented Generation (RAG) - Techniques for building knowledge-grounded AI systems
  • Embedding Models & Fine-tuning - Practical guides for optimizing AI model performance
  • Multimodal AI - Exploring AI systems that work with text, images, and other modalities
  • Developer Relations - Bridging the gap between complex AI research and practical implementation

My articles combine theoretical understanding with hands-on implementation guides, helping developers and researchers build better AI applications.

All Articles

Together Evaluations: Benchmark Models for Your Tasks

Introducing a comprehensive evaluation framework for benchmarking AI models across various tasks and domains.

July 28, 2025
Together AI Evaluation Benchmarks

Back to The Future: Evaluating AI Agents on Predicting Future Events

A comprehensive benchmark for evaluating AI agents on their ability to predict future events and outcomes.

July 17, 2025
Together AI AI Agents Evaluation

From Zero to One: Building An Autonomous Data Scientist Agent

A technical deep dive into building an autonomous AI agent capable of performing data science tasks from scratch.

June 12, 2025
Together AI AI Agents Data Science

Direct Preference Optimization: A Technical Deep Dive

An in-depth exploration of Direct Preference Optimization techniques for improving AI model alignment and performance.

April 17, 2025
Together AI Fine-tuning Optimization

Continued Fine‑tuning of LLMs: A Technical Deep Dive

Comprehensive guide to continued fine-tuning techniques for large language models, covering advanced optimization strategies.

April 17, 2025
Together AI LLMs Fine-tuning

Open Deep Research

Exploring the principles and practices of open research in deep learning and artificial intelligence.

April 16, 2025
Together AI Research Open Source

Long Context Fine‑Tuning: A Technical Deep Dive

Advanced techniques for fine-tuning language models with extended context windows, enabling better long-form understanding.

November 25, 2024
Together AI Long Context Fine-tuning

Multimodal Document RAG with Llama 3.2 Vision and ColQwen2

Building advanced retrieval-augmented generation systems that can process both text and visual document content using state-of-the-art models.

October 08, 2024
Together AI Multimodal RAG

Advanced RAG Techniques

Learn how to improve the individual indexing, retrieval and generation parts of your RAG pipeline!

July 25, 2024
Weaviate rag advanced

OpenAI's Matryoshka Embeddings

How to use OpenAI's embedding models trained with Matryoshka Representation Learning in a vector database like Weaviate

June 18, 2024
Weaviate embeddings openai

Step-by-Step Guide to Choosing the Best Embedding Model for Your Application

How to select an embedding model for your search and retrieval-augmented generation system.

June 04, 2024
Weaviate tutorial embeddings

32x Reduced Memory Usage With Binary Quantization

In-depth technical breakdown of how binary quantization works and how to use it in Weaviate.

April 02, 2024
Weaviate optimization quantization

Accelerating Vector Search up to +40% with Intel's latest Xeon CPU - Emerald Rapids

Boosting Weaviate using SIMD-AVX512, Loop Unrolling and Compiler Optimizations

March 26, 2024
Weaviate performance hardware

Multimodal Retrieval-Augmented Generation (RAG)

Learn how to build Multimodal Retrieval Augmented Generation (MM-RAG) systems that combine text, images, audio, and video. Discover contrastive learning, any-to-any search with vector databases, and practical code examples using Weaviate and OpenAI GPT-4V.

December 05, 2023
Weaviate multimodal rag

How to Reduce Memory Requirements by up to 90%+ using Product Quantization

The details behind how you can compress vectors using PQ with little loss of recall!

September 19, 2023
Weaviate optimization quantization

A Gentle Introduction to Vector Databases

What is a Vector Database? Explanation of core concepts, such as vector embeddings, vector search, and vector indexing

August 01, 2023
Weaviate concepts vector-databases

Multimodal Models

ML Models that can see, read, hear and more!

June 27, 2023
Weaviate multimodal concepts

Private LLM

A discussion on data privacy and privacy-preserving machine learning for LLMs

May 30, 2023
Weaviate privacy llms

How to ChatGPT Plugin

A show-and-tell of how we created the Weaviate Retrieval Plugin for ChatGPT

April 27, 2023
Weaviate tutorial chatgpt

Weaviate Retrieval Plugin

Learn how you can connect Weaviate to ChatGPT to generate customized responses.

April 04, 2023
Weaviate tutorial chatgpt

What are LLMs

A gentle introduction to Large Language Models (LLMs) - how they work and what they learn.

March 23, 2023
Weaviate concepts llms

How AI Creates Art

Machine learning models can create beautiful and novel images. Learn how Diffusion Models work and how you could make use of them.

January 24, 2023
Weaviate concepts ai-art

Vector Embeddings Explained

Get an intuitive understanding of what exactly vector embeddings are, how they're generated, and how they're used in semantic search.

January 16, 2023
Weaviate concepts embeddings

The Details Behind the Sphere Dataset in Weaviate

Learn about the hardware, software and performance metric specifications behind our ~1B object import of the Sphere dataset into Weaviate.

December 27, 2022
Weaviate datasets performance

The Sphere Dataset in Weaviate

Learn how to import and query the Sphere dataset in Weaviate!

December 06, 2022
Weaviate datasets tutorial