Kimi

Moonshot AI's long-context chatbot

chatno codeFree tier

Overview

Kimi by Moonshot AI offers a 2M token context window — one of the longest available. Excels at analyzing lengthy documents, codebases, and research papers in Chinese and English. Free to use with daily limits. Popular among Chinese knowledge workers.

Best For

+Long documents
+Chinese language
+Research
+Document analysis

News & Updates

[AINews] Moonshot Kimi K2.6: the world's leading Open Model refreshes to catch up to Opus 4.6 (ahead of DeepSeek v4?)

Yay Kimi!!!

BREAKTHROUGHS4/24/2026

MiniMax M2.5 Achieves Fastest API Response Time at 0.118s TTFT in Performance Benchmark

We’ve been benchmarking a few models on our API platform and got some interesting performance numbers: - MiniMax M2.5 → 0.118s time-to-first-token, 103 tokens/sec - GLM 5.1 → 120 tokens/sec throughput - Kimi K2.5 → 0.643s TTFT, 69 tokens/sec - All models → ~99.9% request success rate The latency difference is especially noticeable, ~0.1s TTFT feels almost instant in interactive apps. Let me know how you're evaluating LLM APIs. Are you optimizing more for latency, thro

NEW_TOOLS4/14/2026

Wikimind: A CLI that compiles raw documents into an interlinked wiki using LLMs

1 points · 0 comments

other4/10/2026