4 min read
|
Saved February 14, 2026
|
Copied!
Do you care about this?
Google has launched Magika 1.0, an AI-powered file type detection tool that now supports over 200 file types, up from about 100. The new version features a Rust-based engine for improved performance and accuracy, with better detection for specialized file formats and a native command-line client.
If you do, here's more
Magika 1.0 has been released, marking a significant upgrade to Google's AI-driven file type detection system. This new version supports over 200 file types, doubling the number from its initial release. Notable additions include support for Jupyter Notebooks, various machine learning formats, and programming languages like Swift and Kotlin. The system can now more accurately differentiate between similar formats, such as JSON vs. JSONL and C++ vs. C.
The core of Magika has been completely rebuilt in Rust, enhancing performance and memory safety. The new command-line tool can process hundreds of files per second on a single core and scale to thousands on modern multi-core CPUs. A significant technical challenge was managing a training dataset that exceeded 3TB. To address data scarcity for specialized file types, generative AI was utilized to create synthetic training data, ensuring robust performance across diverse formats.
Installation is straightforward for users on Linux, macOS, and Windows, with commands provided for easy setup. Developers can integrate Magika into applications using various programming languages, with extensive documentation available for guidance. The community is encouraged to try the tool, report issues, and contribute to its ongoing development.
Questions about this article
No questions yet.