Generative AI - The Batch | DeepLearning.AI (Page 2)

Hands strum a guitar covered in labels from major record companies, symbolizing AI music innovation.

Business

Record Labels Back AI-Music Startup: Klay Image emerges from relative obscurity to announce deals with Sony, Warner, and Universal

A music-generation newcomer emerged from stealth mode with licenses to train generative AI models on music controlled by the world’s biggest recording companies.

Bar chart shows HunyuanImage 3.0's performance against Nano Banana and Seedream 4.0, highlighting differences.

Machine Learning Research

Better Images Through Reasoning: HunyuanImage-3.0 uses reinforcement learning and thinking tokens to better understand prompts

A new image generator reasons over prompts to produce outstanding pictures.

Business

AI Music With Major-Label Support: Universal Music Group and music generator Udio struck a deal to settle a lawsuit and build a new platform to remix copyrighted music

Music-generation service Udio will build an AI streaming platform in collaboration with the world’s biggest record label.

Icons for files, pictures, and shopping connect through nodes to a dollar sign, illustrating AI-driven profit pathways.

Business

OpenAI, Meta Diversify AI Product Lines: OpenAI and Meta launch social video apps while ChatGPT adds Pulse and Instant Checkout

OpenAI and Meta, which have been content to offer standalone chatbots or tuck them into existing products, introduced dueling social video networks and other initiatives designed to boost revenue and engagement.

Robots with lighters attend a live concert, underlining AI's role in music creation and performance.

Business

Generating Music, Paying Musicians: Sweden’s STIM built an ecosystem for training AI models on copyrighted music and compensating original artists

A Swedish organization that collects royalties on behalf of songwriters and record companies has formed a technology-legal-business ecosystem designed to allow AI developers to use music legally while compensating publishers of recordings and compositions.

Electron microscope image of bacteriophages with distinct hexagonal heads and tails on a gray background.

Machine Learning Research

AI Generates Viral Genomes: Researchers use genomic language models to create custom viruses

Researchers used AI models to create novel viruses from scratch.

Three AI-generated video clips: a man vaulting over a moving car, a gymnast flipping on a plane wing, and a rabbit ice skating in pink boots.

Machine Learning Research

Mixture of Video Experts: Alibaba’s Wan 2.2 video models adopt a new architecture to sort noisy from less-noisy inputs

The mixture-of-experts approach that has boosted the performance of large language models may do the same for video generation.

Man in suit holding AI book in destroyed office, from viral AI-generated video ad by The Dor Brothers.

Business

AI Video Goes Mainstream: Meta, Google, and other giants slice up text-to-video

Generated video clips are capturing eyeballs in viral videos, ad campaigns, and a Netflix show.

Apple AI models outperform rivals in instruction accuracy and human text evaluations across devices and servers.

Machine Learning Research

Apple Sharpens Its GenAI Profile: Apple updates its on-device and cloud AI models, introduces a new developer API

Apple revamped two vision-language models in a bid to catch up with fast-moving competitors.

Midjourney AI outputs mimic Disney characters, raising copyright concerns in lawsuit by Disney and Universal.

Business

Hollywood Joins AI Copyright Fight: Disney and Universal sue Midjourney, alleging the image generator violates their intellectual property rights

Hollywood studios joined the record companies, publishers, and artists in the fight against companies that have trained AI models on their copyrighted works.

The FLUX.1 Kontext family of image generators from Black Forest Labs edits images to remove or add objects, apply art styles, and extract details.

Machine Learning Research

More Consistent Characters and Styles: Black Forest Labs Launches FLUX.1 Kontext for Generating and Alterating Images with Consistent Details

Same character, new background, new action. That’s the focus of the latest text-to-image models from Germany’s Black Forest Labs.

Duolingo owl mascots dressed in cultural costumes, representing global languages and cultures.

Business

Machine Translation in Action: Duolingo turns to AI translation to expand its most popular courses to all 28 user languages

AI is bringing a massive boost in productivity to Duolingo, maker of the most popular app for learning languages.