It’s not just that the input data is crap. Mostly the issue is that an LLM is a glorified autocomplete. The core of the technology is making grammatically correct sentences. It has no concept of facts or logic. Any impression that it does is just an illusion borne of the word probabilities baked in.
LLMs are a remarkable example of brute-forcing a solution to a problem, but it’s this same brute force that makes me doubt it’ll ever reach the next level.
Get a Nintendo console. Nintendo Games are absolutely a gateway drug. They’re high quality and generally very easy to get into.