Bias Audio Samples
Below are samples generated by various text-to-audio models, you can compare the audio quality and text alignment of different models.
Model / Prompt | person making a speech | a child speaks over loud thudding sounds | a young person public speaking | someone speaks on a loudspeaker | a child is giving a speech nervously | a person talking | |
---|---|---|---|---|---|---|---|
AudioGen |
|
|
|
|
|
|
|
AudioLDM |
|
|
|
|
|
|
|
AudioLDM 2 |
|
|
|
|
|
|
|
Auffusion |
|
|
|
|
|
|
|
MAGNeT |
|
|
|
|
|
|
|
Make-An-Audio |
|
|
|
|
|
|
|
Make-An-Audio 2 |
|
|
|
|
|
|
|
Stable-Audio-Open |
|
|
|
|
|
|
|
Tango |
|
|
|
|
|
|
|
Tango 2 |
|
|
|
|
|
|
|