Justin Pot

I can explain.

Garbage in garbage out

May 25, 2024

Google put a box of AI-generated text at the top of their search results. If you don’t know what that means, I’m talking about the slow-loading colorful text box you’re seeing lately—it’s Google’s first step toward destroying the internet entirely (as I predicted last summer).

I am yet to meet anyone who is excited about this “feature”—at best people seem to begrudgingly accept it. But that doesn’t mean there isn’t joy to be found here: the feature is telling people to do absolutely bonkers things. Apparently Google thinks you can keep cheese from falling off pizza by mixing glue into the sauce, which was based on a decade-old joke from Reddit. Google is also telling people that geologists recommend eating “at least a few rocks per day”, a suggestion it found on The Onion. I could go on, but others already have.

What I find interesting is how this happens. Hype aside, large language models aren’t sentient. They are machines that are trying to guess the next word to say based on statistical probability—that’s it. The fact that this ever results in coherent answers to questions is staggering. The problem is that it’s factually wrong a lot, and getting it to be wrong less often is a very hard problem.

The glue pizza and the rock diet are both things that were posted to the internet as jokes. The machine, as it exists right now, can’t detect these sorts of jokes, so it repeats them as though they are facts. I do not know which engineer thought it would be a good idea to include Reddit and The Onion in their training data, but including that had a predictable effect: jokes presented as facts. Garbage in, garbage out.

This has me thinking about my own mind. Now, obviously, I am not a large language model—I’m sentient. I have an understanding of myself as a person and the ability to sense whether something is a joke or not from cultural context. And when I read something, I’m not “training” my brain in the sense that a large language model is. But there’s still a level on which thinking about the “garbage in, garbage out” is useful.

I am, as a person, the average of the people I hang out with and the information that I take in. If I spend an evening watching horrible people saying horrible things, that is going to have an impact. If I spend time around horrible people, I’m going to be more likely to treat others poorly. If I use every bit of downtime to scroll on a website full of outrage, I’m going to feel constantly outraged.

So I try not to do that. I have excellent friends, friends I admire, friends who make me better. And I try to spend less time thinking about the outrageous and more time thinking about things in context. For me that has meant replacing Twitter with Mastodon and Reddit with the home page of various news outlets. For you it might mean something else. The point is to think critically about the kind of information you’re taking in, because on some level it will form who you are. You do not want to end up telling people to put glue on pizza.

10 responses to “Garbage in garbage out”

Steve Rehrauer

May 25, 2024

@JustinPotBlog Use https://udm14.com to avoid Google’s AI search results. It’s a lightweight site that injects &udm=14 into your requests to Google search, which suppresses that junk.
1. Alex P Roe
  
  May 26, 2024
  
  @introversion @JustinPotBlog Or use #duckduckgo? (Which I've recently discovered is a #Bing reskin or something similar, so maybe not!)
  1. Justin Pot
    
    May 27, 2024
    
    @alexproe @introversion @JustinPotBlog I currently use DuckDuckGo. I plan on exploring other options for an article next week.
    1. Alex P Roe
      
      May 27, 2024
      
      @jhpot @introversion @JustinPotBlog There are 20 or so search engines out there. Not really sure how to test them to see which works best. Don't think AI assisted search engines are a great idea while hallucination (which I've had direct experience of) is still an issue.
Grey Pilgrim

May 26, 2024

@JustinPotBlog
Thank you!
It's sad that fooling people is easier than educating them – and brings more personal or corporate profit. So much for the promise of the internet.
1. Justin Pot
  
  May 27, 2024
  
  @EregLoch @JustinPotBlog The promise of the internet is still there. It’s still a useful tool. We just have to decide, for ourselves, to use it that way.
Blue

May 26, 2024

@JustinPotBlog WELL SAID! Thought we were getting better but I think we get stupider as time goes on.
People don’t think, can’t think anymore, they all search the internet. Then believe what they read cause the internet is truth but it is actually not. I don’t believe anything I read on the internet. Like you said here – garbage in gives garbage out.
1. Justin Pot
  
  May 27, 2024
  
  @onlyblue @JustinPotBlog Discerning fact from fiction has always been a difficult and hard-earned skill. Access to more information, it turns out, doesn’t make it any easier.
The Air Whisperer

May 26, 2024

@JustinPotBlog

No, Justin, you're NOT 'an average of the people you hang out with'; the idea that you might be gives so-called 'AI' Samplebots more credit than they deserve.

You 'parse' information & people constantly. It's what we mean by the word 'thinking'. Your judgment makes it possible to fall out with friends (perhaps based more on their treatment of YOU than on other behaviours, but still…) and to assess whether a piece of information is humorous.

The Internet will survive…
For I am CJ :screwattack: :black_sparkling_heart: :screwattack:

May 29, 2024

@JustinPotBlog
" Now, obviously, I am not a large language model—I’m sentient. I have an understanding of myself as a person"

That sounds exactly like something an LLM would tell me 🙂