Click any tag below to further narrow down your results
Links
The article argues for the shift from generalized software solutions to bespoke software tailored for specific company needs. It discusses the limitations of off-the-shelf solutions and highlights the potential of LLMs to enable smaller companies to create custom tools efficiently. The author emphasizes the importance of cutting legacy systems to improve software integration and management.
This article examines a dataset of over 100 trillion tokens from the OpenRouter platform to understand how large language models (LLMs) are used in practice. It highlights trends in model adoption, task categories, and user retention patterns, revealing a shift towards more complex interactions and the impact of early user engagement.
ShinkaEvolve is an innovative evolutionary code optimization framework that utilizes large language models (LLMs) to discover new algorithms with unprecedented sample efficiency. It has achieved state-of-the-art solutions in various domains, including Circle Packing and agent design, by significantly reducing the number of samples needed for effective program evolution. The framework is open-sourced to empower researchers and engineers in their scientific discoveries and development efforts.
Character.AI has open-sourced pipeling-sft, a scalable framework designed for fine-tuning large-scale MoE LLMs like DeepSeek V3. This framework addresses challenges in training efficiency and stability, integrating multi-level parallelism and supporting various precision formats, while facilitating seamless HuggingFace integration for researchers.