Is ChatGPT for music being made by someone?
Is ChatGPT for music being made by someone?

Is ChatGPT for music being made by someone?

So I was thinking, could I teach chatgpt music. The problem was that I can not feed chatgpt midi files.

To do that, I figured I have to write a tool that reads binary midi files and turns them to ascii so that it understands notes. So I did that. And fed a song to chatgpt. All 8 tracks of it in form of ascii.

Then my thinking was that if I feed that to chatgpt, it would learn to do something like that. Naah. It understands simple melodies, but even then, it tends to start dreaming very fast after the initial melody. It struggles writing pieces with multiple instruments, it struggles with understanding chords.

Ie, it is not made for this purpose.

But as I was doing this, I realized, this is the way of the future. AI that can do this must be just around the corner and it has a megaton of material it can gobble in form of midi files to learn.

Now the problem will be of course the same as what picture generation ai's have. Hallucinations, being able to stay in right time signature, REALLY understanding what music IS. Verses, choruses, bridges, intros and outros.. It understand the TEXT really well, but for AI to learn how to do music. It has to be taught the LANGUAGE of music which is notations.. Ideally it should be able to read and write different daw files. Fl studio, Cubase, ableton, straight up midi and so forth. But on the top of that it should have ability to understand audio, someone singing to it.

Able to do with notes/ audio with chatGPT does with words.

I can already see a future where a composer is sitting with virtual Beethoven next to him or her. Talking about music, having him help in composing pieces. Or Drake, or 50 cent, or you get my point. Composer being helped by ai that understands music. Different styles.

But it has to be taught music first, it has to start from something first. Who is making something like this? One would think someone. I do not think llm is fit for this. The llm side works as a interface for using it, but it has to think in notes.

submitted by /u/aluode
[link] [comments]