Kyt Dotson
2025-06-03 12:00:00
siliconangle.com
Vertesia Inc., a unified low-code platform for developing and deploying custom generative artificial intelligence applications, today announced the launch of a new semantic document preparation service it says will increase the reliability of AI applications and speed up development.
Vertesia provides a cloud-based application programming interface service that prepares underlying data for use by generative AI models, ensuring output accuracy. According to the company’s own research, up to 50% of the development time spent on generative AI applications is dedicated to document preparation.
The new semantic document preparation service is designed to ease this process and provide developers a rich context for large language models to work with, which Vertesia claims can “eliminate” generative AI hallucinations.
A hallucination is an error where an LLM generates an incorrect or false answer that it states confidently. The causes can be numerous, including training data issues, inherent limitations or challenges in understanding nuanced language or context such as incomplete or noisy data.
“The two concerns we hear most from enterprise leaders are consistent: 95% accuracy isn’t good enough and data preparation is a costly, time-consuming challenge,” said Chief Revenue Officer Chris McLaughlin. “Our Semantic DocPrep service was built to solve both — giving developers a set of APIs to automate document preparation and significantly improve the accuracy and relevancy of LLM outputs.”
The company said using its preparation service, it can convert even the most complex documents, such as reports and regulatory filings, into richly structured, semantically tagged XML. It will do this without rewriting or altering the source. Since the process preserves the original structure, relationships and context, it ensures that the LLM can understand the document without misinterpreting the information, which greatly increases the accuracy of responses.
This document transformation method is designed for developers building custom generative AI applications and retrieval-augmented generation pipelines, also known as RAG, which are used to enhance the accuracy of generative AI apps with real-time data.
The company said its data transformation engine deconstructs documents at the page level and uses the most appropriate AI model based on the content: dense text, tabular data, images or a mix. It will either use LLMs, optical character recognition or vision models. By using this hybrid model, it avoids rewrites to maintain consistency and preserve the original text and generate high-fidelity XML outputs.
The service is accessible via an API, which can be combined directly into a development pipeline. This allows developers to send documents for preparation and receive XML outputs ready for chunking, indexing and model ingestion. No setup or model training is required.
The new Semantic DocPrep is part of the company’s already existing platform, which provides infrastructure for organizations who are looking to build, deploy and manage custom generative AI applications and agents at scale.
Photo: Annie Spratt/Unsplash
Your vote of support is important to us and it helps us keep the content FREE.
One click below supports our mission to provide free, deep, and relevant content.
Join our community on YouTube
Join the community that includes more than 15,000 #CubeAlumni experts, including Amazon.com CEO Andy Jassy, Dell Technologies founder and CEO Michael Dell, Intel CEO Pat Gelsinger, and many more luminaries and experts.
THANK YOU
Enjoy the perfect blend of retro charm and modern convenience with the Udreamer Vinyl Record Player. With 9,041 ratings, a 4.3/5-star average, and 400+ units sold in the past month, this player is a fan favorite, available now for just $39.99.
The record player features built-in stereo speakers that deliver retro-style sound while also offering modern functionality. Pair it with your phone via Bluetooth to wirelessly listen to your favorite tracks. Udreamer also provides 24-hour one-on-one service for customer support, ensuring your satisfaction.
Don’t miss out—get yours today for only $39.99 at Amazon!
Help Power Techcratic’s Future – Scan To Support
If Techcratic’s content and insights have helped you, consider giving back by supporting the platform with crypto. Every contribution makes a difference, whether it’s for high-quality content, server maintenance, or future updates. Techcratic is constantly evolving, and your support helps drive that progress.
As a solo operator who wears all the hats, creating content, managing the tech, and running the site, your support allows me to stay focused on delivering valuable resources. Your support keeps everything running smoothly and enables me to continue creating the content you love. I’m deeply grateful for your support, it truly means the world to me! Thank you!
BITCOIN bc1qlszw7elx2qahjwvaryh0tkgg8y68enw30gpvge Scan the QR code with your crypto wallet app |
DOGECOIN D64GwvvYQxFXYyan3oQCrmWfidf6T3JpBA Scan the QR code with your crypto wallet app |
ETHEREUM 0xe9BC980DF3d985730dA827996B43E4A62CCBAA7a Scan the QR code with your crypto wallet app |
Please read the Privacy and Security Disclaimer on how Techcratic handles your support.
Disclaimer: As an Amazon Associate, Techcratic may earn from qualifying purchases.