3 links
tagged with all of: machine-learning + image-processing
Click any tag below to further narrow down your results
Links
FlexTok is a method for resampling images into 1D token sequences of flexible length, with official implementations and pre-trained models available on GitHub. The repository includes instructions for installation, usage examples, and model checkpoints, emphasizing the importance of using trusted sources for loading checkpoints due to potential security vulnerabilities. Users can easily integrate the FlexTok tokenizer and VAE inference into their projects using provided code snippets and Jupyter notebooks.
Thyme introduces a groundbreaking approach to image processing by autonomously generating and executing code for complex visual reasoning tasks. Utilizing a two-stage training strategy that combines supervised fine-tuning and reinforcement learning, along with the innovative GRPO-ATS algorithm, it effectively enhances performance in high-resolution perception.
Thera is an innovative super-resolution method that incorporates a physical observation model, allowing for arbitrary-scale image enhancement. The project, developed by a team from ETH Zurich and the University of Zurich, includes a comprehensive set of tools, including training and evaluation scripts, and has gained attention on platforms like Hacker News. Users can easily install the necessary environment and utilize pre-trained models to super-resolve images efficiently.