Apple has unveiled updates to its on-device and server foundation language models, enhancing generative AI capabilities while prioritizing user privacy. The new models, optimized for Apple silicon, support multiple languages and improved efficiency, incorporating advanced architectures and diverse training data, including image-text pairs, to power intelligent features across its platforms.
Dots.ocr, a new 3B parameter OCR model from RedNote, enables competitive on-device optical character recognition, leveraging Apple's Neural Engine for efficiency. The article outlines the challenges and processes involved in converting the model from PyTorch to Core ML, detailing the steps taken to optimize its performance for on-device use. Future parts of the series will focus on further integration and optimization strategies.