Technology

Sony CSL develops Diff-A-Riff, an AI-powered music generation tool (a "stem generator") designed to assist musicians in creating instrumental accompaniments that naturally complement existing musical material. The system utilizes a Latent Diffusion Model (LDM) to generate high-quality audio tracks at a 48kHz pseudo-stereo resolution.

Artists using Diff-A-Riff can influence the generated accompaniments through practical input controls: either by providing reference audio tracks to indicate musical style, instrumentation, or mood, or by describing desired musical characteristics using textual prompts.

Useful links
Requirements for artistic team
members' skills

Applicants must have a strong interest in:


  • knowledge in music production
  • basic knowledge of generative AI

A background in the following areas would be a plus:

  • a technical framework (like max/python/cpp/others)