Music editing is an important step in music production, which has broad applications, including game development and film production. Most existing zero-shot text-guided methods rely on pretrained diffusion models by involving forward-backward diffusion processes for editing. However, these methods often struggle to maintain the music content consistency. Additionally, text instructions alone usually fail to accurately describe the desired music. In this paper, we propose two music editing methods that enhance the consistency between the original and edited music by leveraging score distillation. The first method, SteerMusic, is a coarse-grained zero-shot editing approach using delta denoising score. The second method, SteerMusic+, enables fine-grained personalized music editing by manipulating a concept token that represents a user-defined musical style. SteerMusic+ allows for the editing of music into any user-defined musical styles that cannot be achieved by the text instructions alone. Experimental results show that our methods outperform existing approaches in preserving both music content consistency and editing fidelity. User studies further validate that our methods achieve superior music editing quality.
This part contains demonstration of our SteerMusic method on zero-shot text-guided music editing task.
Source Music
Source Prompt
Target Prompt
SteerMusic
MusicMagus
ZETA
SDEdit
DDIM
Source Music
Source Prompt
Target Prompt
SteerMusic
MusicMagus
ZETA
SDEdit
DDIM
Source Music
Source Prompt
Target Prompt
SteerMusic
MusicMagus
ZETA
SDEdit
DDIM
Source Music
Source Prompt
Target Prompt
SteerMusic
MusicMagus
ZETA
SDEdit
DDIM
Source Music
Source Prompt
Target Prompt
SteerMusic
MusicMagus
ZETA
SDEdit
DDIM
Source Music
Source Prompt
Target Prompt
SteerMusic
MusicMagus
ZETA
SDEdit
DDIM
Source Music
Source Prompt
Target Prompt
SteerMusic
MusicMagus
ZETA
SDEdit
DDIM
Source Music
Source Prompt
Target Prompt
SteerMusic
MusicMagus
ZETA
SDEdit
DDIM
Source Music
Source Prompt
Target Prompt
SteerMusic
MusicMagus
ZETA
SDEdit
DDIM
Source Music
Source Prompt
Target Prompt
SteerMusic
MusicMagus
ZETA
SDEdit
DDIM
Source Music
Source Prompt
Target Prompt
SteerMusic
MusicMagus
ZETA
SDEdit
DDIM
Source Music
Source Prompt
Target Prompt
SteerMusic
MusicMagus
ZETA
SDEdit
DDIM
This part contains demonstration of our SteerMusic+ method on personalized music editing task.
Source Music | Target Concept | SteerMusic+ | DreamSound | Textual Inversion |
---|---|---|---|---|
"A relaxing reggae track with acoustic drums bass keys guitar and female vocals."
|
[Reggae]
|
"A relaxing [reggae] track with acoustic drums bass keys guitar and female vocals."
|
"A relaxing [reggae] track with acoustic drums bass keys guitar and female vocals."
|
"A relaxing [reggae] track with acoustic drums bass keys guitar and female vocals."
|
"A recording of fingerstyle acoustic guitar with a classical atmosphere."
|
[Sitar]
|
"A recording of fingerstyle acoustic [Sitar] with a classical atmosphere."
|
"A recording of fingerstyle acoustic [Sitar] with a classical atmosphere."
|
"A recording of fingerstyle acoustic [Sitar] with a classical atmosphere."
|
"A guitar tutorial with energetic technique and ambient noises."
|
[Bouzouki]
|
"A [bouzouki] tutorial with energetic technique and ambient noises."
|
"A [bouzouki] tutorial with energetic technique and ambient noises."
|
"A [bouzouki] tutorial with energetic technique and ambient noises."
|
"A recording of an intense rhythmic drum battle"
|
[Bouzouki]
|
"A recording of an intense rhythmic [bouzouki] battle"
|
"A recording of an intense rhythmic [bouzouki] battle"
|
"A recording of an intense rhythmic [bouzouki] battle"
|
"A lively country tune featuring banjo acoustic piano violin and upright bass."
|
[Sitar]
|
"A lively country tune featuring banjo acoustic [sitar] violin and upright bass."
|
"A lively country tune featuring banjo acoustic [sitar] violin and upright bass."
|
"A lively country tune featuring banjo acoustic [sitar] violin and upright bass."
|
"A recording of renaissance music with soft wooden percussions and a mellow harmonized flute melody."
|
[Ocarina]
|
"A recording of renaissance music with soft wooden percussions and a mellow harmonized [ocarina] melody."
|
"A recording of renaissance music with soft wooden percussions and a mellow harmonized [ocarina] melody."
|
"A recording of renaissance music with soft wooden percussions and a mellow harmonized [ocarina] melody."
|
"A melancholic pop song with acoustic piano strings electronic bass and female vocals."
|
[Ocarina]
|
"A melancholic pop song with acoustic [ocarina] strings electronic bass and female vocals."
|
"A melancholic pop song with acoustic [ocarina] strings electronic bass and female vocals."
|
"A melancholic pop song with acoustic [ocarina] strings electronic bass and female vocals."
|
"A tranquil complex jazz live performance featuring instrumental improvisation on organ saxophone bass guitar and acoustic drums."
|
[Morricone]
|
"A tranquil complex [morricone] live performance featuring instrumental improvisation on organ saxophone bass guitar and acoustic drums."
|
"A tranquil complex [morricone] live performance featuring instrumental improvisation on organ saxophone bass guitar and acoustic drums."
|
"A tranquil complex [morricone] live performance featuring instrumental improvisation on organ saxophone bass guitar and acoustic drums."
|
"A recording featuring an amateur DJ performance with turntable scratching and electronic drums."
|
[Reggae]
|
"A recording featuring an amateur DJ performance with turntable scratching and [reggae] drums."
|
"A recording featuring an amateur DJ performance with turntable scratching and [reggae] drums."
|
"A recording featuring an amateur DJ performance with turntable scratching and [reggae] drums."
|
"A recording of eclectic rebellious rock music featuring an electric guitar solo keyboard bass guitar and acoustic drums."
|
[Hiphop]
|
"A recording of eclectic rebellious [hiphop] music featuring an electric guitar solo keyboard bass guitar and acoustic drums."
|
"A recording of eclectic rebellious [hiphop] music featuring an electric guitar solo keyboard bass guitar and acoustic drums."
|
"A recording of eclectic rebellious [hiphop] music featuring an electric guitar solo keyboard bass guitar and acoustic drums."
|
"A recording of rock and roll with electric guitar bass guitar acoustic drums and male vocals."
|
[Hiphop]
|
"A recording of [hiphop] and roll with electric guitar bass guitar acoustic drums and male vocals."
|
"A recording of [hiphop] and roll with electric guitar bass guitar acoustic drums and male vocals."
|
"A recording of [hiphop] and roll with electric guitar bass guitar acoustic drums and male vocals."
|
"A mellow rock piece featuring two guitars and a sultry female singer."
|
[Sarabande]
|
"A mellow [sarabande] piece featuring two guitars and a sultry female singer."
|
"A mellow [sarabande] piece featuring two guitars and a sultry female singer."
|
"A mellow [sarabande] piece featuring two guitars and a sultry female singer."
|