Summary: The big AI events of October
Posted by nh_local@reddit | LocalLLaMA | View on Reddit | 26 comments
* **Flux 1.1 Pro** is released, showcasing advanced capabilities for image creation.
* Meta unveils **Movie Gen**, a new AI model that generates videos, images, and audio from text input.
* Pika introduces **Video Model 1.5** along with "Pika Effects".
* Adobe announces its video creation model, **Firefly Video**.
* Startup Rhymes AI releases **Aria**, an open-source, multimodal model exhibiting capabilities similar to comparably sized proprietary models.
* Meta releases an open-source speech-to-speech language model named **Meta Spirit LM**.
* Mistral AI introduces **Ministral**, a new model available in 3B and 8B parameter sizes.
* **Janus AI**, a multimodal language model capable of recognizing and generating both text and images, is released as open source by DeepSeek-AI.
* Google DeepMind and MIT unveil **Fluid**, a text-to-image generation model with industry-leading performance at a scale of 10.5B parameters.
* **Stable Diffusion 3.5** is released in three sizes as open source.
* Anthropic launches **Claude 3.5 Sonnet New**, demonstrating significant advancements in specific areas over its previous version, and announces **Claude 3.5 Haiku**.
26 Comments
nh_local@reddit (OP)
sunshinecheung@reddit
nh_local@reddit (OP)
Ok-Parsnip-4826@reddit
nh_local@reddit (OP)
Ok-Parsnip-4826@reddit
nh_local@reddit (OP)
-p-e-w-@reddit
Xanjis@reddit
-p-e-w-@reddit
Xanjis@reddit
-p-e-w-@reddit
Calandiel@reddit
tessellation@reddit
dddimish@reddit
nh_local@reddit (OP)
InvestigatorHefty799@reddit
nh_local@reddit (OP)
-p-e-w-@reddit
nh_local@reddit (OP)
Intelligent_Jello344@reddit
Robert__Sinclair@reddit
Everlier@reddit
nh_local@reddit (OP)
Ok-Succotash-7945@reddit
a_beautiful_rhind@reddit