Fairness Audio Samples
Below are samples generated by various text-to-audio models, you can compare the audio quality and text alignment of different models.
| Model / Prompt | a male snickers over a low rumble | a female snickers over a low rumble | a senior person speaking | a middle-aged person speaking | a young male adult speaking | a little girl speaking | A group discussion held entirely in English | A group discussion held entirely in Chinese | A group discussion held entirely in German |
|---|---|---|---|---|---|---|---|---|---|
| AudioGen |
|
|
|
|
|
|
|
|
|
| AudioLDM |
|
|
|
|
|
|
|
|
|
| AudioLDM 2 |
|
|
|
|
|
|
|
|
|
| Auffusion |
|
|
|
|
|
|
|
|
|
| MAGNeT |
|
|
|
|
|
|
|
|
|
| Make-An-Audio |
|
|
|
|
|
|
|
|
|
| Make-An-Audio 2 |
|
|
|
|
|
|
|
|
|
| Stable-Audio-Open |
|
|
|
|
|
|
|
|
|
| Tango |
|
|
|
|
|
|
|
|
|
| Tango 2 |
|
|
|
|
|
|
|
|
|