Click any tag below to further narrow down your results
Links
Qwen has launched Qwen3-Max-Thinking, a model aimed at solving difficult math and coding problems. It features a large context window and can perform complex reasoning tasks while integrating tool use and web searches. Developers can access it through Alibaba Cloud's Model Studio for both detailed analysis and quicker responses.
ConciseHint is a proposed framework designed to enhance reasoning efficiency by providing continuous concise hints during the token generation process. It incorporates both manually designed and learned textual hints to optimize model performance. The article includes specific code snippets for setting up the framework using Python and relevant libraries.
Kimi-VL is an open-source Mixture-of-Experts vision-language model that excels in multimodal reasoning and long-context understanding with only 2.8B activated parameters. It demonstrates superior performance in various tasks such as multi-turn interactions, video comprehension, and mathematical reasoning, competing effectively with larger models while maintaining efficiency. The latest variant, Kimi-VL-A3B-Thinking-2506, enhances reasoning and visual perception capabilities, achieving state-of-the-art results in several benchmarks.