WRONG by safetensors

Text-to-image diffusion models are trained to understand key visual concepts with billions of tagged images. It's a similar (but much more rapid) process to how we learn visual concepts. We know what cats are, because we've seen lots of different cats and understand their common features. And if we want to draw a cat, we use our generic knowledge to create a recognisable picture of a cat, but one that is (very likely to be) slightly different from any other cat picture that we've ever seen.

If the diffusion model is trained specifically on spectrograms, however, rather than pictures of cats, fruit, people etc, then something really interesting happens. Spectrograms are visual representations of sound: plots of frequency vs time, so the machine comes to learn that certain keywords have certain spectrographic similarities. It knows what a generic smooth jazz piece 'looks like' in the same way that it knows what a generic dog looks like. You can then prompt the machine to produce new spectrograms in any style you can think of.

All you have to do then is convert these spectrograms into audio, which is possible thanks to some clever code written by other people, and you end up with some excitingly strange new music, produced by a machine that has absolutely no concept of sound, only about images.

But what if you prompt the spectrogram model for something that isn't audio? How will it interpret visual requests? This album was started - as is so often the case - by accident: a detailed image prompt entered into the wrong model. That first result was sufficiently interesting to inspire me to revisit some of my favourite previously generated images, so the track titles here reflect the prompt used to generate those images and the corresponding audio.

I was initially going to include the original images in the download, but I thought it'd be more interesting to get the listener to imagine them.

Tracklist

1.	cookery book photograph, disgusting, rotten, moldy sausages and offal and pickles, plates, bowls, technicolor, saturated colors, 1960s	1:32

2.	humanoid cybernetic robot, tribal headdress, moss, feathers, fruit, organic life forms, plant roots, leaves, blender, octane 8k render	1:32

3.	full museum exhibition of amateur metallic kinetic art, nudes, unknown objects, crystals on wooden plinths in a derelict bunker, 1980s	1:32

4.	stylised illustration of an airport lounge, monoprint, screen print, paris revolt poster, 1968 revolution style, pastel shades	1:32

5.	strange and beautiful marine life, coral, seaweed, pastel colors, jellyfish, octopus, art by Ernst Haekel, organic, ornate	1:32

6.	fantasy futuristic landscape, metropolis, sci-fi city, space port, satellites, megastructure, spaceships, dyson sphere, cyberpunk, art by Fra Angelico and Pieter Bruegel	1:32

7.	porcelain attractive young man, glazed ceramic, reflective, photorealistic, shallow depth of field, studio lighting, complementary pastel colors	1:32

8.	oil painting of a naked man, artistic pose, contortionist, one arm behind head, one arm on hip, facing away, art by Gwen John and Hans Bellmer	1:32

9.	empty hospital ward, machinery, sinister, melancholy, floating spheres, minimalist, dread, faded pastel colors, empty beds, soft lighting	1:32

10.	internal organs, nematodes, fungus, teeth, eyes, roots, wires, bones, porcelain, glass, insects, iridescent sphere, molluscs, lithograph, medical illustration	1:32

11.	crystals, fruit, rocks, coral, chemistry, physics, biology, laboratory equipment, cubes, glassware, tentacles, hand-colored, anatomical, science illustration, 1930s	1:32

12.	elaborate ceremonial mask, recycled, ribbon, bottle tops, electrical components, prisms, feathers, grass, hair, bones, sticks, flowers, beads, gemstones, pom-poms	1:32

13.	enormous junk sculpture, folk art, recycled metal, bones or pipes or sticks, wood, bottles, cans, tins, boxes, religious shrine, ceramic, concrete, foam, rubber	1:32

14.	British living room, sofa, chairs, strange furry mammal or large cocoon or huge crystal structure or large mollusc or large gelatine cuboid, 1950s	1:32

15.	domestic interior, non-euclidian space, dreamlike, sofa, sea creatures, crystals, coral, slime, lasers, wires, internal organs, machinery	1:32

16.	industrial landscape:disneyland, derelict, steelworks, chimneys\|helter-skelters, rust, concrete, oil refinery:theme park, scrapyard, billboards, rain, graffiti, bleak, gloomy	1:32

17.	blurry, creased wet plate photograph of a murder scene	1:32

18.	huge surreal complex empty playground, slides, paddling pool, climbing wall, tubes, spirals, soft play, towers, rails, pipes, merry-go-round	1:32

19.	wunderkammer filled with hair, electrical items, lab equipment, rocks, national geographic, studio lighting, kodachrome	1:32

20.	wall of framed pictures, family photos, paintings, mirrors and children's drawings, faded wallpaper, eastern European domestic interior, 1960s	1:32

21.	fashion photography portrait of a creepy Polynesian woman in a scrapyard wearing a beautiful silver and brown military outfit	1:32

22.	fashion photography portrait of weird Russian pop group in a plane crash, wearing a beautiful teal and primary colored punk outfit	1:32

23.	natural history collection, anatomical illustration, weird marine stones, glass, slime, mycology, hand-colored, 1840s	1:32

24.	large empty car park, floodlight, concrete, tarmac, fading evening light, closed shopping mall, small floating planet	1:32

25.	chaotic ethnological exhibition of found transparent global art, fruit and veg, miniature landmarks, candles, marble statues, fossils on plinths in a damaged office	1:32

26.	a large pile of randomly sized weird blue and white body parts and blankets in the middle of an abandoned classroom, fujicolor, 1970s	1:32

27.	natural history illustration, grotesque marine shrimp , vintage machinery, spider, vases, dermatology, hand-colored, 1850s	1:32

Credits

released April 26, 2023

Spectrograms by Stable Diffusion with the Riffusion 1.0 model.
Processed into audio with chavinlo's Riffusion Manipulation Tools.
Cover image by Stable Diffusion.

License