How to Summarize Texts Using the BART Model with Hugging Face Transformers

by AI

November 4, 2024

in AI

Reading Time: 4 mins read

125 5

Jayita Gulati
2024-11-04 12:00:00
www.kdnuggets.com

How to Summarize Texts Using the BART Model with Hugging Face Transformers

Image by Editor | Ideogram

BART is a tool that helps you summarize text. It can take long writings and make them shorter and easier to read. This helps you find the main points quickly. BART works by analyzing the entire text to understand its context. Then, it generates a summary by keeping the important parts and removing the less important ones.

With BART, you can summarize articles, reports, and other texts. It focuses on the key information to create a clear and concise version. Hugging Face Transformers is a library that makes using BART simple. In this article, we will show you how to set up BART and create summaries.

Why Use BART for Text Summarization?

BART is highly effective for text summarization because it can:

Understand context: BART can read and understand long texts well. It finds the important points to make a good summary.
Generate coherent summaries: BART makes summaries that are easy to read. It keeps the important details and removes unneeded information.
Handle various types of text: BART can summarize many kinds of texts, like news articles, research papers, or stories. It is flexible and works well with different content.

Let’s now walk through how to use the BART model with Hugging Face Transformers to summarize texts.

Setting Up the Environment

Before using the BART model, ensure you have the necessary libraries installed. You will require the Hugging Face Transformers library.

Loading the BART Model

Next, you need to set up the summarization pipeline. You can load the pre-trained BART model using the following code:

from transformers import pipeline

# Load the summarization pipeline with the BART model
summarizer = pipeline("summarization", model="facebook/bart-large-cnn")

summarizer: A variable that stores the summarization pipeline.
pipeline: A high-level API provided by Hugging Face for easy access to various models.
summarization: Specifies the task to be performed, which is text summarization.
model=”facebook/bart-large-cnn”: Loads the BART model, which is pre-trained for summarizing texts.

Preparing the Input Text

Next, you need to prepare the input text that you want to summarize. The input text needs to be broken into smaller parts called tokens.

input_text = """
Climate change means a long-term change in temperature and weather. It can happen in one place or the whole Earth. Right now, climate change is happening in many areas. It affects nature, water, food, and health. Scientists see changes in the climate over time. Most of these changes are caused by human actions. Activities like burning fossil fuels and cutting down trees lead to climate change. These actions increase greenhouse gases in the air. Greenhouse gases hold heat in the air and make the Earth hotter. This causes global temperatures to rise.
"""

Summarizing the Text

To summarize the text, you simply pass the input_text to the summarizer pipeline.

# Generate the summary
summary = summarizer(input_text, max_length=50, min_length=25, do_sample=False)

# Output the summarized text
print(summary[0]['summary_text'])

max_length: Defines the maximum length of the generated summary in terms of tokens.
min_length: Sets the minimum length of the summary. This makes sure the summary is not too brief.
do_sample=False: Ensures deterministic results by using greedy decoding instead of sampling.

This will print a shorter version of the input text.

Climate change means a long-term change in temperature and weather. Activities like burning fossil fuels and cutting down trees lead to climate change. Greenhouse gases hold heat in the air and make the Earth hotter.

Conclusion

Using the BART model with Hugging Face Transformers is an easy way to summarize text. You can set it up quickly and start summarizing in a few simple steps. First, you load the pre-trained model and tokenizer. After that, you put in your text. The model will make a shorter version of it. This saves time and helps you see the important details. Get started with BART today and make summarizing text simple and fast!

Jayita Gulati is a machine learning enthusiast and technical writer driven by her passion for building machine learning models. She holds a Master’s degree in Computer Science from the University of Liverpool.

Source Link

Support Techcratic

If you find value in Techcratic’s insights and articles, consider supporting us with Bitcoin. Your support helps me, as a solo operator, continue delivering high-quality content while managing all the technical aspects, from server maintenance to blog writing, future updates, and improvements. Support Innovation! Thank you.

Bitcoin Address:

bc1qlszw7elx2qahjwvaryh0tkgg8y68enw30gpvge

Please verify this address before sending funds.

Bitcoin QR Code

Simply scan the QR code below to support Techcratic.

Please read the Privacy and Security Disclaimer on how Techcratic handles your support.

Disclaimer: As an Amazon Associate, Techcratic may earn from qualifying purchases.

Tags: AI NEWS

Cookie	Duration	Description
cookielawinfo-checkbox-analytics	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Analytics".
cookielawinfo-checkbox-functional	11 months	The cookie is set by GDPR cookie consent to record the user consent for the cookies in the category "Functional".
cookielawinfo-checkbox-necessary	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookies is used to store the user consent for the cookies in the category "Necessary".
cookielawinfo-checkbox-others	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Other.
cookielawinfo-checkbox-performance	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Performance".
viewed_cookie_policy	11 months	The cookie is set by the GDPR Cookie Consent plugin and is used to store whether or not user has consented to the use of cookies. It does not store any personal data.

How to Summarize Texts Using the BART Model with Hugging Face Transformers

Why Use BART for Text Summarization?

Setting Up the Environment

Loading the BART Model

Preparing the Input Text

Summarizing the Text

Conclusion

Support Techcratic

Bitcoin QR Code

What Does RTT Mean On iPhone?

ChromeOS gets a major update with plenty of goodies and new features

Related Posts

Leave a Reply Cancel reply

Tech News

Tech News

Tech News

Tech News​

Tech News