Melodia is still being trained, many of these tips will become unnecessary in the coming weeks.
Tip 0: If Melodia mispronounces a word, try spelling it differently.
Tip 1: Add (or tell the auto-prompt box to) include markers like [Build] and [Drop] for EDM type songs in addition to [Chorus] and [Verse]. Using [Verse 1] causes Melodia to start from the beginning of the song. If you want to "skip to the good part" just start the lyrics without a tag (and maybe without a capital letter for the first word). Make sure to use [square brackets] and not (parentheses)
Tip 2: Use the auto-prompts for descriptions, they're usually good.
Tip 3: Melodia usually doesn’t use the voices of the listed artists, but names in prompts do have an important stylistic influence. It can create similar voices to some artists (like Frank Sinatra, Ella Fitzgerald, Johnny Cash, and Jason Derulo). Ping zaptrem on Discord with others you find!
Tip 4: To make the lyrics sound more like an artist, try adding their name to the verse/chorus tags (e.g., [Chorus: Taylor Swift]) Note this doesn’t always work and may cause the model to say the name out loud instead.
Tip 5: Have a set of lyrics but can’t think of a style? Try advanced + NO description prompt (or just a period)! Melodia will produce some interesting results.
Tip 6: The style and content of lyrics can sometimes have a significant effect on the output song style. Check the lyrics for your target style to see if they tend to phrase things in a certain way. For example, in God’s Plan by Drake:
“I been movin' calm, don't start no trouble with me Tryna keep it peaceful”
Notice the lyrics say “movin’” instead of moving, “Tryna” instead of trying to, and “don’t start no trouble with me” instead of don’t start trouble with me.
Tip 7: Capitalization of lyrics can cause them to be placed in the next verse.
Tip 8: Prompt Boost: For people who are familiar with diffusion models, this cranks up the classifier free guidance scale. It forces the model to pay more attention to the prompt at the expense of song quality. I suggest not using it and am considering removing it.
Examples of good Advanced mode descriptions (try the first one!):