For now, there are ways to figure out what’s computer generated, in this article, we will be discussing about How to Detect AI-Generated Text — Here’s Why That’s a Bad Thing.
- New research shows that people learn to spot machine-generated text.
- Knowing what’s written by AI is getting more urgent with the proliferation of software like ChatGPT.
- But some experts say that recognizing AI-generated text will become impossible as AI improves.
It’s getting harder and harder to tell the difference between sentences generated by artificial intelligence (AI) and people.
The good news is that a new paper shows that people can learn to spot the difference between AI-generated text and human-written text. The ability to determine if computers craft online information is becoming critical with the rise of large-scale language models such as ChatGPT, which many predict will touch virtually every corner of our lives.
Ways to use AI-generated content
AI-generated content is best used as a writing assistant instead of relying strictly on technology. Here are some ways to use AI tools for assistance with content:
- Research. For writers having issues organizing a topic or coming up with ideas, AI-generated text can help them get started. Some tools give ideas about what to include for broader topics to help narrow down the research process.
- Overcome writer’s block. For writers who know their keyword or topic, AI-generated content tools can help them get started by offering a few hundred words on the subject. Some tools recommend headers so writers can get moving and adapt their content.
- Proofread current material. To make sure a drafted article is optimized, writers can run it through AI tools for a grade. The tool can also highlight keywords and phrases that should be used. AI tools can also assist with checking grammar and correcting spelling mistakes.
- Write short content. AI tools can produce a lot of content in a short amount of time, so they are a great way to reduce boredom with repetitive tasks. While some communications require more of an emotional side, some short descriptions do not. Product descriptions, metatags, ad copy and social media posts are examples of short text for content generators.
- Translate language. For written material to appeal to all audiences, AI generators can help translate content into different languages.
- Create templates. AI tools can help create emails or other templates. Some AI tools offer different types of ready-made templates for people to plug in customized information.
Telling the Difference Between Real and ChatGPT
Researchers from the University of Pennsylvania wrote their paper by examining data collected using “Real or Fake Text?”, a web-based training game. Participants are asked to indicate whether a machine has produced a given text in a yes-or-no fashion. This task involves classifying a text as real or fake and scoring responses as correct or incorrect. The study found that participants scored significantly better than random chance.
“Our method not only gamifies the task, making it more engaging, it also provides a more realistic context for training,. “Generated texts, like those produced by ChatGPT, begin with human-provided prompts.”
At moments when real human connection matters, we’d like to hear from real people, not machines.
Toyama said there are many reasons why learning to distinguish between computer-generated text and human writing is important. For example, teachers would like to know whether students are submitting essays they wrote themselves or text written partly or entirely by computer. He noted that recently, Vanderbilt University sent an email written by ChatGPT to its community linking the tragedy of the Michigan State shooting to campus efforts toward inclusion. There was a swift backlash.
“There are plenty of other such situations,” he added. “In fact, I believe that one essential form of regulation of AI is that the law should require any text, image, audio, video, or other creative output generated by a computer to be clearly marked as such.”
Spotting AI-Generated Text
Toyama is pessimistic that people can consistently spot the difference between AI-generated text and human-generated text, saying, “in the long term, it will become virtually impossible because the AI will become better and better.” He pointed to informal experiments that suggest that even experienced teachers have difficulty distinguishing student writing from ChatGPT.
“I have a colleague who claims that he spotted two instances of ChatGPT submissions because the students’ writing had suddenly dramatically improved—but, ChatGPT can be directed to write at different levels of ability or to introduce errors,” he added.
Computers might be used to identify if a particular text is human-made. Parag Arora, the CEO of Kwegg, an AI content system, told Lifewire in an email interview that research is underway to develop programs that spot machine-generated writing.
There is a larger concern with AI-generated content – students using this tool to cheat. Several professors and teachers stated that the AI-generated text on ChatGPT created convincing essays.
To help distinguish between human and AI text, OpenAI announced a new AI text classifier on Jan. 31, 2023. There are other online tools to help detect AI-generated text by classifying how likely it was written by a person versus AI, such as Originality.ai, Writing.com or Copyleaks.
OpenAI is also working on creating a watermark for longer AI-generated text for an immediate identifier.
In addition to running text on an online AI-content detector, there are a few other signs that content is AI-generated, such as:
- Lack of typos. Human writing often has typos and contains slang terms. AI-generated text rarely has typos such as grammar mistakes or misspellings.
- Overuse of “the.” AI-generated text is based on predictive language by determining which word should be next, so it commonly uses words such as “the,” “it” or “is” instead of more unusual terms.
- No cited sources. AI-generated text will often give facts and not cite the sources.
- Shorter sentences. AI content sentence length is typically shorter because it is trying to mimic human writing but appears more choppy than human writing.
- Repetition of words or phrases. If certain words or phrases are used abundantly, this could be a sign of AI-generated content because it is trying to fill space with relevant keywords.
- Lack of analysis. AI can collect data but cannot analyze the data. If an article feels full of facts without any insight, it might be AI-generated.
“AI classifiers being launched by research organizations, including OpenAI itself, look promising,” he added. “However, GPT advances seem to be always a step ahead of them, and by nature of research, classifiers are always at least one generation behind generators. Some work has been done on leaving cryptic signatures as a policy for every GPT generator, which can be a sure-shot step toward solving this problem.”
There are some ways for users to spot machine-generated text, Robert Brandl, the CEO of Tooltester, said in an email interview. He suggests trying out AI tools like ChatGPT to see what kind of content they generate to get more familiar with them. Among the giveaways are formulaic structures where sentences start with “Firstly” and “Secondly,” etc., and a lack of sarcasm or humor in the text.
Example of AI-generated content
There are many AI content generators available. Here is an example of how it works using a program called CopyAI:
- Select style of writing from the left side.
- Answer topic.
- Add main points.
- Select tone.
A couple options pop up for text. Here is the AI-generated content from the information entered. This was the first option meant to imitate the information in this article:
AI is an amazing technology that has the potential to change our world and how we live in it. But like any new technology, there are pros and cons to AI-generated content—and businesses need to be aware of both before deciding whether or not to use it.
First, the pros:
1) AI-generated content saves time and money for businesses by allowing them to produce more content with fewer resources.
2) It allows businesses to create highly personalized content for each customer, which leads to happier customers.
3) AI-generated content makes sure that all information is correct, which helps prevent expensive lawsuits or bad press if something goes wrong with your business’s operations (e.g., if you send out an email with incorrect information).
Now, let’s talk about some of the cons:
1) AI-generated content can sound unnatural (like it was written by a robot), which can make customers feel uncomfortable or distrustful of your business. It doesn’t matter how much effort you put into making sure that everything is correct if the end result sounds robotic!
2) If someone finds out that they’ve been reading an AI-generated piece of content instead.
Much of the information included in this TechTarget article is outlined in the AI-generated content above. However, the AI content is not particularly comprehensive or conversational — even when selecting a friendly tone. This TechTarget article goes into greater detail for explanations and provides examples. The AI-generated content seems to be more of an outline and needs additional information to flow as a human-written article.
“As AI develops, however, the tools that are designed to spot AI text will learn too,” he added. “Indeed, it may get to a point where people need to rely on these tools to understand when AI is being used, that is, if publishers don’t openly disclose AI usage anyway.”
Comments