Quit Emailing Yourself

2 links tagged with all of: deep-learning + image-processing

Links

GitHub - apple/ml-flextok: FlexTok: Resampling Images into 1D Token Sequences of Flexible Length

FlexTok is a method for resampling images into 1D token sequences of flexible length, with official implementations and pre-trained models available on GitHub. The repository includes instructions for installation, usage examples, and model checkpoints, emphasizing the importance of using trusted sources for loading checkpoints due to potential security vulnerabilities. Users can easily integrate the FlexTok tokenizer and VAE inference into their projects using provided code snippets and Jupyter notebooks.

Saved by tldr-importer · Last saved October 29, 2025 · 3 min read

+ flextok + machine-learning image-processing ✓ + tokenization deep-learning ✓

Mask Image Watermarking

MaskMark is a novel framework for image watermarking that offers two variants: MaskMark-D for global and local watermark extraction, and MaskMark-ED for enhanced robustness in localized areas. It employs a masking mechanism during the decoding and encoding stages to improve accuracy and adaptability while maintaining high visual quality. Experimental results demonstrate its superior performance over existing models, requiring significantly less computational cost.

Saved by tldr-importer · Last saved October 29, 2025 · 2 min read

+ watermarking + computer-vision image-processing ✓ + robustness deep-learning ✓