What do you think about Ollama?
Ollama makes running large language models locally as simple as a single terminal command. Supports 100+ models including Llama, Mistral, Gemma, Phi, and DeepSeek. Handles model downloading, quantization, and serving with an OpenAI-compatible API. The de facto standard for local LLM development.