Writing | Zain Hasan

Technical deep dives and practical guides on RAG, vector databases, embeddings, LLM fine-tuning, and multimodal AI. Published on the Weaviate and Together AI engineering blogs.

Fine-Tuning Open LLM Judges to Outperform GPT-5.2↗

Open-source LLM judges fine-tuned with DPO can outperform GPT-5.2 at evaluating model outputs. We trained GPT-OSS 120B on 5,400 preference pairs to beat GPT-5.2's accuracy — delivering superior performance at 15x lower cost and 14x faster speeds.

February 02, 2026

Together AI Fine-tuning Evaluation

How to Evaluate and Benchmark Large Language Models (LLMs)↗

A comprehensive guide to evaluating and benchmarking LLMs, covering key metrics, methodologies, and best practices for model selection.

November 04, 2025

Together AI LLMs Evaluation

Dynamic AI Agent Testing for the Real World with Collinear Simulations and Together Evals↗

How to use Collinear Simulations with Together Evals for dynamic, real-world testing of AI agents in production scenarios.

October 28, 2025

Together AI AI Agents Evaluation

Fine-Tuning Platform Upgrades: Larger Models, Longer Contexts, Enhanced Hugging Face Integrations↗

Major upgrades to the Together AI fine-tuning platform including support for larger models, longer context windows, and deeper Hugging Face integration.

September 10, 2025

Together AI Fine-tuning Platform

Together Evaluations: Benchmark Models for Your Tasks↗

Introducing a comprehensive evaluation framework for benchmarking AI models across various tasks and domains.

July 28, 2025

Together AI Evaluation Benchmarks

Back to The Future: Evaluating AI Agents on Predicting Future Events↗

A comprehensive benchmark for evaluating AI agents on their ability to predict future events and outcomes.

July 17, 2025

Together AI AI Agents Evaluation

From Zero to One: Building An Autonomous Data Scientist Agent↗

A technical deep dive into building an autonomous AI agent capable of performing data science tasks from scratch.

June 12, 2025

Together AI AI Agents Data Science

Direct Preference Optimization: A Technical Deep Dive↗

An in-depth exploration of Direct Preference Optimization techniques for improving AI model alignment and performance.

April 17, 2025

Together AI Fine-tuning Optimization

Continued Fine‑tuning of LLMs: A Technical Deep Dive↗

Comprehensive guide to continued fine-tuning techniques for large language models, covering advanced optimization strategies.

April 17, 2025

Together AI LLMs Fine-tuning

Open Deep Research↗

Exploring the principles and practices of open research in deep learning and artificial intelligence.

April 16, 2025

Together AI Research Open Source

Long Context Fine‑Tuning: A Technical Deep Dive↗

Advanced techniques for fine-tuning language models with extended context windows, enabling better long-form understanding.

November 25, 2024

Together AI Long Context Fine-tuning

Multimodal Document RAG with Llama 3.2 Vision and ColQwen2↗

Building advanced retrieval-augmented generation systems that can process both text and visual document content using state-of-the-art models.

October 08, 2024

Together AI Multimodal RAG

Advanced RAG Techniques↗

Learn how to improve the individual indexing, retrieval and generation parts of your RAG pipeline!

July 25, 2024

Weaviate rag advanced

OpenAI's Matryoshka Embeddings↗

How to use OpenAI's embedding models trained with Matryoshka Representation Learning in a vector database like Weaviate

June 18, 2024

Weaviate embeddings openai

Step-by-Step Guide to Choosing the Best Embedding Model for Your Application↗

How to select an embedding model for your search and retrieval-augmented generation system.

June 04, 2024

Weaviate tutorial embeddings

32x Reduced Memory Usage With Binary Quantization↗

In-depth technical breakdown of how binary quantization works and how to use it in Weaviate.

April 02, 2024

Weaviate optimization quantization

Accelerating Vector Search up to +40% with Intel's latest Xeon CPU - Emerald Rapids↗

Boosting Weaviate using SIMD-AVX512, Loop Unrolling and Compiler Optimizations

March 26, 2024

Weaviate performance hardware

Multimodal Retrieval-Augmented Generation (RAG)↗

Learn how to build Multimodal Retrieval Augmented Generation (MM-RAG) systems that combine text, images, audio, and video. Discover contrastive learning, any-to-any search with vector databases, and practical code examples using Weaviate and OpenAI GPT-4V.

December 05, 2023

Weaviate multimodal rag