All Posts

5 Misconceptions About Data and AI Projects

This article by Seth Earley was originally published on MDM.COM.

Machine learning and AI programs run on data. The quality and reliability of that data is a critical ingredient to your formula for leveraging AI. The old “garbage in/garbage out” saying still applies no matter how advanced the algorithm.

There have been many misconceptions regarding AI that have impacted the success of these projects. For AI projects, having the correct “training data” is critical to a positive outcome. Many projects go over budget or are not completed on time due an underestimation of the time needed to train the algorithm or the inability to access the correct data.

Here are five misconceptions about data and AI projects:

1. The AI will fix the data

At the height of AI hype, many vendors of AI technology claimed that their algorithms could ingest data that was incomplete or of poor quality and were smart enough to find patterns and make predictions even if the data was in poor shape. This is simply not the case. It is true that some algorithms can help with data quality but those use cases are highly specific and still require the right “reference data” that the system could use to train and find or correct issues with operational data.

2. Point the AI to “all of the data” and it will find the correct solution

Context is as important for AI as it is for people. Just like people need to orient when looking for answers (you don’t look for iPhone solutions in a car repair manual) the data source for AI requires curation and context. If we are building a question-answering system for a consumer, it does not make sense to ingest complex engineering documents. When IBM trained Watson to play Jeopardy!, ingesting some data sources reduced performance. More data was not necessarily helpful. The program required carefully selected data.

3. Cognitive AI (chatbots and intelligent virtual assistants) can be deployed out of the box

There are some very limited use cases where a chatbot can be turned on out of the box. However, chat bots and IVAs need the same training that a human needs. You would never drop a new hire into a support role without training. The AI needs the same. Any meaningful functionality will be powered by your knowledge and data sources and those sources require the correct format and structure to be retrieved by a cognitive assistant. Chat bots are a channel – to knowledge, content and information.

4. AI Data issues can be solved by IT

In many projects, IT is left with addressing data problems that arise from business processes and business decisions. Imagine that salespeople will not enter data into a CRM system. That is not something that IT can solve since it is a business process issue. It cannot be simply outsourced to a low-cost offshore provider. Data needs to be owned by the business and support business goals. IT is the enabler but cannot own business data.

5. AI will eliminate the need for data governance

Data governance is more important than ever. What data is owned by the organization? What can be done with it? What are the data sources and how is it being consumed or translated by other systems and processes? How well is data being leveraged to produce value for the enterprise and the customer? How can data issues be addressed and remediated? The data infrastructure of the organization is essential. Investments need to be prioritized and results measured. Strong data governance helps get the organization’s data house in order.

The future belongs to organizations that can best merge their processes, business value and customer relationships with advance AI capabilities. Data is critical and in fact is more important than the algorithm. Getting your data house in order needs to be a priority with board-level attention and funding commensurate with the scale of the enterprise and data challenges. That will be a formula for success.

Seth Earley
Seth Earley
Seth Earley is the Founder & CEO of Earley Information Science and the author of the award winning book The AI-Powered Enterprise: Harness the Power of Ontologies to Make Your Business Smarter, Faster, and More Profitable. An expert with 20+ years experience in Knowledge Strategy, Data and Information Architecture, Search-based Applications and Information Findability solutions. He has worked with a diverse roster of Fortune 1000 companies helping them to achieve higher levels of operating performance.

Recent Posts

[7/20/2022] Powering Personalized Search with Knowledge Graphs

Transforming Legacy Faceted Search into Personalized Product Discovery July 20, 2022 @ 1:00PM EDT, 6:00PM BST The latest in e-commerce trends is the transformation of legacy faceted search into a more personalized experience. By applying semantic reasoning over a knowledge graph, contextual information about a customer can be combined with product data, delivering relevant search results tailored to them. The first half of this webinar is designed for the business executive. We’ll focus on why personalized search is an essential e-commerce ingredient. And we’ll demystify the process of implementing a more personalized product discovery experience for your customers. The second half of the webinar is designed for the data strategist. We’ll cover the data modeling required to build knowledge graphs for successful personalized search. We’ll include a real-world demonstration and cover the steps you can take to get started. Who should attend: Executives who care about e-commerce and the data experts who enable them. Speakers:

AIs That Can Draw - It is Still About the Data

Creativity is considered to be a bastion of humanness and somewhat outside of the realm of artificial intelligence.  But AI can be used to generate variations of artistic themes that appear to be creations of their own. 

[RECORDED] Artificial Intelligence Begins With Information Architecture

Building An AI-Powered Enterprise Recorded Webcast In this webinar we establish the formula for AI success:  AI-Powered solutions are only as good as the data that fuels them. Successful AI requires a semantic data layer built on a solid enterprise information architecture. We’ll demystify this topic for executives and provide actionable advice for data strategists. Who should attend: Executives who care about AI and the data experts who enable them. Speakers: