Strap in (and harness up)Is the harness more important that the model? A ramble through climbing history and the AI present and future. Random alrightTomcw.xyz·3d ago·11 min readTechFreedomBearingtemplateOllama
Weeknote 52Silly cartoons, random exlorations of a less open sector via the medium of the physical laws of the universe, a trip to cornwall, work on Bearings, sharing Glade a tool for governance, and a bumper li…Tomcw.xyz·6d ago·11 min readTechFreedommini TechFreedom session at TechNExtJames MartinBearing
Having a gander at GooseHarsh Browns [Unofficial]·May 18·4 min readGooselocal and open first principlenative support for LM StudioOllama
Inference EndpointAn inference endpoint is the serving layer for a trained model. After training (or downloading) an LLM, you need infrastructure to accept requests, run the forward pass, and return outputs at scale. T…Sahil Kapoor's Playbook·May 17·3 min readVllmTokenizationOllamaOpenrouter
TokenizationTokenization is the first step in any LLM pipeline: converting raw text into a sequence of integer IDs that the model actually processes. Understanding tokenization helps you reason about context wind…Sahil Kapoor's Playbook·May 17·3 min readLangchainVllmOllamaPrompt Engineering
LoRA (Low-Rank Adaptation)LoRA (Low-Rank Adaptation) is a fine-tuning method introduced by Hu et al. at Microsoft in 2021. Instead of updating all billions of parameters in a large model, LoRA freezes the original weights and …Sahil Kapoor's Playbook·May 17·3 min readPrompt EngineeringSystem PromptRlhfVllm
LangChainLangChain is an open-source framework that provides building blocks for LLM applications. Rather than calling OpenAI's API directly and wiring everything by hand, LangChain gives you composable abstra…Sahil Kapoor's Playbook·May 17·3 min readOllamaSystem PromptMcp Model Context ProtocolPrompt Engineering
OpenRouterA unified API gateway for large language models that lets you call 100+ LLMs from different providers through a single OpenAI-compatible endpoint with automatic fallback and cost routing.Sahil Kapoor's Playbook·May 17·2 min readOllamaVllmInference EndpointLangchain
OpenHandsOpenHands (formerly OpenDevin) is an open-source platform for AI software engineering agents. Unlike Cursor or Windsurf which are IDEs with AI assistance, OpenHands is a platform where AI agents opera…Sahil Kapoor's Playbook·May 17·3 min readCursorWindsurfOllamaMcp Model Context Protocol
GitHub CopilotGitHub Copilot, launched in 2021 and built on OpenAI Codex (later GPT-4), was the first AI pair programmer to reach mainstream adoption. It integrates as an extension into VS Code, JetBrains, Neovim, …Sahil Kapoor's Playbook·May 17·3 min readCursorWindsurfOllamaVllm
vLLMvLLM (Virtual LLM) is an open-source inference engine from UC Berkeley that dramatically increases the throughput of serving large language models on GPU hardware. It was introduced in 2023 with Paged…Sahil Kapoor's Playbook·May 17·3 min readHelmArgocdTraefikNginx
Don't Let the Asphalt Bury the GardenI’ve spent 30 years watching tech cycles come and go, from the first dial-up modems in rural Austria to the mesh networks I’m currently stringing across the Australian bush. Each time a “next big thin…Digital Nomad·Apr 28·9 min readopinionAiOpen SourceDigital Sovereignty