From article to a headline: How to create headlines with AI?

Frederik Simon Bäumer

Last modified: 14.03.2023 5 minutes read

60e7f63994ab35f129cfd3af__4ec16d3a-6257-49a0-a785-7bbb7c5852b1.jpg

This article shows how authors can use GPT-3 to generate headlines based on article abstracts while keeping the magazine's writing style.

Headlines are one of the most critical aspects of an article. They draw readers in and give them a sense of what the article is about. A good headline will make an article more likely to be read, while a bad headline will discourage people from reading it. Headlines should be clear and concise and should accurately reflect the article's content. They should also be exciting and attention-grabbing so that readers will want to click on them. Spending time crafting the perfect headline can be worth it, as it can mean the difference between an article being read or ignored.

But coming up with a good headline can be challenging. Authors want something that accurately reflects the content of the article, but also want something that will grab attention and make people want to read more. This is where AI can help. Several AI-powered tools can help create better headlines for articles. These tools use algorithms to analyze the content and come up with accurate and attention-grabbing headlines. But, no surprise, in this article, all we need is GPT-3.

What are we going to look at?

Let's look at the basic title generation procedure with GPT-3. Based on this, we will collect some sample data from the magazine "Stern" to imitate the style of the magazine:

Headline generation (Zero-shot)
Data acquisition with AI support
Headline generation (Few-shot)

Headline generation (Zero-shot)

As explained in the previous article, Zero-shot means to avoid giving GPT-3 more examples for what is expected as a result. The only information provided for generation is the task description in natural language. In this case, we take the abstract of a "Stern" article titled "Schädlicher Whatsapp-Kettenbrief verspricht Milka-Gewinnspiel – so erkennen Sie Betrugsversuche" and tell GPT-3 to create a German headline (green texts parts are generated by GPT-3).

Not a bad headline. We can repeat the generation as many times as we want, and, depending on the parameterization, we will get more outputs. Here is a selection:

Vorsicht bei angeblichem Milka-Gewinnspiel!
Mondelez warnt vor angeblichem Milka-Gewinnspiel
Mondelez warnt vor Fälschung eines angeblichen Milka-Gewinnspiels

In the following, let's see if we can push the headings a bit more into the "Stern" style.

Data acquisition with AI support

The generated headlines already read well. However, they are not yet in style we know from "Stern". Therefore, we want to give GPT-3 a few examples in the following, which AI will consider during the generation. This procedure is called Few-shot. However, since we don't have any titles available, we must collect some. And in a good programmer mentality, we automate what can be automated.

However, acquiring data is not an exciting task from a programmer's point of view. Especially when it comes to collecting freely available data without additional security measures, these are precisely the titles of the articles on the "Stern" website. An RSS feed is offered, which can be accessed without much effort. We take the RSS feed from the "Panorama" section.

Therefore, the programming task is so simple that one can also let GPT-3 do it. Consequently, we ask GPT-3 if it can write the corresponding source code for a Python script. This also happens as instruction in natural language (The video shows an example of how to access the titles).

Now we have our examples that we can use in the following. Here are some examples for the titles:

"Cold Case": Mordserie nach mehr als 30 Jahren aufgeklärt – DNA-Spuren überführen "I-65-Killer"
Ukraine-Krieg: Nach Rückzug russischer Truppen: Mehr als 400 Menschen in Hostomel vermisst
Blick in die Vergangenheit: 13,5 Milliarden Lichtjahre entfernt: Forscher entdecken offenbar die entfernteste Galaxie, die je beobachtet wurde
Video: München: Stimmungsbild nach Ablehnung der Impfpflicht ab 60

... Okay, we learn, in "Stern" colons are taken very seriously.

Headline generation (Few-shot)

We give the collected examples to the language model in a semi-structured format. Here we use the prefixes "title" and "abstract" to signal to GPT-3 what we are entering and separate the respective examples with a line break. For the abstract for which we want a title generated, we enter only the abstract (with prefix) and start the following line with "title:" so that GPT-3 knows we want a completion here. We thus give GPT-3 20 examples, resulting in a total request of 2,263 tokens, which will cost approximately $0.15 to process.

To be honest, I am not an expert on headlines, but it is already recognized that the characteristic ":" is used in the title, and hyphens are used. And again, we can repeat the generation as many times as we want, and, depending on the parameterization, we will get more outputs. Here is a selection:

Milka-Gewinnspiel: Achtung, Fälschung!
Whatsapp-Kettenbrief: "Gewinne einen Oster-Geschenkkorb voller Schokolade" - Mondelez warnt: Fälschung!
Milka-Gewinnspiel: Schokoladen-Kettenbrief auf Whatsapp ist eine Fälschung
Whatsapp-Kettenbrief: Mondelez warnt vor Fälschung
Whatsapp-Kettenbrief: Milka warnt vor angeblichem Gewinnspiel - Schokolade gibt es nicht zu gewinnen

Let's summarize.

Creating article headlines is a creative and challenging task. They should be meaningful, not too short, not too long, contain everything necessary and be catchy. AI can help to create variants and thus fuel the creative process. GPT-3 allows you to use examples to determine how titles should look. The example shown here is limited to the maximum request length to GPT-3. In production, one would use fine-tuning here, in which hundreds to thousands of examples are used to “tune” the text generation. We will talk about that soon.

That’s it for now.

The opinions and information stated in this article are personal to the individual author and do not necessarily represent Bertelsmann.

Frederik Simon BäumerData ScientistArvato

#nlp#artificial-intelligence#gpt3#X-DIA

Newest job offers

Working Student Analytics & AI Engineer

Berlin, BE, DE, 10777

Data Analytics

View job offer

SAP Architect

Kuala Lumpur, 14, MY, 50400

SAP Consulting / Development

View job offer

Data Platform Engineer (Fixed-Term Contract to April 2026)

London, GBN, GB, SW1V 2SA

Data Engineering

View job offer

Data Visualisation Scientist

London, LND, GB, SW11 7BW

Data Science

View job offer

Senior Data Engineer

Kuala Lumpur, 14, MY, 50400

Artificial Intelligence / Machine Learning

View job offer

SAP basis engineer

Brasov, BV, RO, 500446

Networks and Infrastructure

View job offer

Engineer, Infrastructure (Kubernetes/AWS) (Open to remote)

New York, NY, US, 10019

Networks and Infrastructure

View job offer

Senior PHP Developer

Kuala Lumpur, 14, MY, 50400

Web Development

View job offer

Create Your Own Career

On our career website "Create Your Own Career" you can discover the wide range of entry and career opportunities at Bertelsmann and be inspired by our employee stories!

Find more interesting jobs

From article to a headline: How to create headlines with AI?

What are we going to look at?

Headline generation (Zero-shot)

Data acquisition with AI support

Headline generation (Few-shot)

Let's summarize.

About the Author

Tags

Share Article

Newest job offers

Working Student Analytics & AI Engineer

SAP Architect

Data Platform Engineer (Fixed-Term Contract to April 2026)

Data Visualisation Scientist

Senior Data Engineer

SAP basis engineer

Engineer, Infrastructure (Kubernetes/AWS) (Open to remote)

Senior PHP Developer

Create Your Own Career