ChatGPT 3.5 Turbo and its Anthropic equivalent are total simps

loathsome dongeater@lemmygrad.ml · 6 months ago

ChatGPT 3.5 Turbo and its Anthropic equivalent are total simps

☆ Yσɠƚԋσʂ ☆@lemmygrad.ml · 6 months ago

It’s key to keep in mind how LLMs work. Fundamentally, they’re not really different from Markov chains. It’s a giant graph of token that the network has been trained on, and all it’s doing is predicting the next most likely token based on the input. What output it produces is directly based on what data it’s been trained on.

I’ve found a few good uses for LLMs. I find they generally do a decent job with doing text summaries, and they can also be useful for code examples. While the code they produce isn’t necessarily correct, it’s often helpful for pointing your in the right direction and faster than looking through stuff like stack overflow. Caveat there is that you already have to understand what you’re trying to do.

Another use that I’ve found interesting is to use LLM as a sounding board. If I’m trying to explore an idea, having a chat with it can be useful because the responses can often stimulate new ideas. Accuracy is not an issue in this case because I already understand the topic I’m trying to explore, and the value is in the phrasings that can fall out of the LLM that can give me a mental thread to pull on.

Similarly, LLMs are also great for practising languages. I’m learning Mandarin right now, and the app has a built in LLM chat bot that you can talk to.

I think even more interesting use cases are some of the recent stories from China where LLMs are being used for monitoring infrastructure and predicting where to do proactive maintenance. This is a great use case because these things are great at correlating large volumes of data and doing extrapolations. But the output is simply advice to the human operator who’s actually making the final decision.

KrasnaiaZvezda@lemmygrad.ml · 6 months ago

New robots are also using LLMs both for understanding their enviroment with cameras, rather than complicated sensors that might not understand the world as we do, and for controlling movement by basically taking in the data from the robot and what other LLMs understand from the enviroment and predicting what inputs are needed to move correctly for movement or doing any tasks.

As the LLMs get better they can also come up with better strategies too, which is already being used to some extent to have them create, test and fix codes based on output and error messages and this should soon allow fully autonomous robots as well that can think by themselves and interact with the world leading to many advancements, like full automation of work and scientific discoveries.

☆ Yσɠƚԋσʂ ☆@lemmygrad.ml · 6 months ago

For sure, I think LLMs might turn out to be a good way to coordinate high level action in robotics.