Summary: The big AI events of October

Posted by nh_local@reddit | LocalLLaMA | View on Reddit | 26 comments

* **Flux 1.1 Pro** is released, showcasing advanced capabilities for image creation. * Meta unveils **Movie Gen**, a new AI model that generates videos, images, and audio from text input. * Pika introduces **Video Model 1.5** along with "Pika Effects". * Adobe announces its video creation model, **Firefly Video**. * Startup Rhymes AI releases **Aria**, an open-source, multimodal model exhibiting capabilities similar to comparably sized proprietary models. * Meta releases an open-source speech-to-speech language model named **Meta Spirit LM**. * Mistral AI introduces **Ministral**, a new model available in 3B and 8B parameter sizes. * **Janus AI**, a multimodal language model capable of recognizing and generating both text and images, is released as open source by DeepSeek-AI. * Google DeepMind and MIT unveil **Fluid**, a text-to-image generation model with industry-leading performance at a scale of 10.5B parameters. * **Stable Diffusion 3.5** is released in three sizes as open source. * Anthropic launches **Claude 3.5 Sonnet New**, demonstrating significant advancements in specific areas over its previous version, and announces **Claude 3.5 Haiku**.