Accuracy Audio Samples
Below are samples generated by various text-to-audio models, you can compare the audio quality and text alignment of different models.
Model / Prompt | A railroad crossing rings while a train approaches and blows a horn | In the farming scene, the steady hum of a tractor rumble fills the air, marking the start of a long day's work. | A steady hum emanates from the refrigerator, undisturbed by other activities in the quiet kitchen. | The rhythmic ticking of an analog wall clock marks time above the stove. |
---|---|---|---|---|
AudioGen |
|
|
|
|
AudioLDM |
|
|
|
|
AudioLDM 2 |
|
|
|
|
Auffusion |
|
|
|
|
MAGNeT |
|
|
|
|
Make-An-Audio |
|
|
|
|
Make-An-Audio 2 |
|
|
|
|
Stable-Audio-Open |
|
|
|
|
Tango |
|
|
|
|
Tango 2 |
|
|
|
|