All Posts

AIs That Can Draw - It is Still About the Data

Creativity is considered to be a bastion of humanness and somewhat outside of the realm of artificial intelligence.  But AI can be used to generate variations of artistic themes that appear to be creations of their own. 

But I would say this computer based creativity is a reflection of the creativity of the programmers who are building the algorithms that simulate this most human trait.  

Openai has created a program called DALL-E 2 that can create or change images based on textual descriptions. The striking capabilities of this technology has vast implications for creativity as part of the field of  synthetic media.  Deep fakes - AI generated images, video and audio based on source files - can create videos of people saying and doing things that have never happened.  DALL-E 2 adds a very interesting capability by interpreting textual descriptions and combining concepts with an artistic style or enabling visual elements to be added that are consistent with the subject style and blend in various natural effects such as lighting and shadows.  The result is amazing.  

The key to these capabilities is having the correct training data (no surprise) - images labeled with concepts that tell the algorithm the characteristics of a cat for example. One image was of a “cute cat” (one could argue that most cats are pretty cute - that is if you like cats) - the program had to be trained on cute cats.  But the question I have is whether this is “post-coordinated” or “pre-coordinated“ - was the training on “cats” and separately on “cuteness” (post-coordinated - just as ecommerce sites use separate facets to filter products)  or on “cute cats”. (So called pre-coordinated or combined into a single concept)  My guess is the latter due to the subjectivity of what “cute” is.

Image credit: https://daleonai.com/dalle-5-mins

When working on a digital asset management project a couple years back we had to define ambiguous and subjective attributes.  One image of lollipops with faces on them was interpreted as cute by some and creepy by others.  

Images are notoriously difficult to describe using text descriptions.  But consider the labels as handles on existing images rather than descriptions of the image.  Therefore the training data - the images representing concepts - needs to be carefully chosen to define the inputs to DALL-E. This is generally true of any AI technology.  In many cases the data is more important than the algorithm. Humans select training data and ultimately humans have to label that data.  

The application of algorithms to creative endeavors allow humans to use judgment in selecting and evaluating various AI generated outputs.  This is a great example of augmentation of of distinctly human abilities that can improve human creativity and productivity.  They depend on the right training data correctly labeled just as in every application of AI.

Here's an explanation of the CLIP model that DALL-E 2 uses to connect text and images:

 

 A gallery of some DALL-E 2 generated art:

https://www.instagram.com/twominutepapers/

Another highly geeky deep dive into OpenAI's paper on DALL-E 2:

 

 

Seth Earley
Seth Earley
Seth Earley is the Founder & CEO of Earley Information Science and the author of the award winning book The AI-Powered Enterprise: Harness the Power of Ontologies to Make Your Business Smarter, Faster, and More Profitable. An expert with 20+ years experience in Knowledge Strategy, Data and Information Architecture, Search-based Applications and Information Findability solutions. He has worked with a diverse roster of Fortune 1000 companies helping them to achieve higher levels of operating performance.

Recent Posts

The Future of Bots and Digital Transformation – Is ChatGPT a Game Changer?

Digital assistants are taking a larger role in digital transformations. They can improve customer service, providing more convenient and efficient ways for customers to interact with the organization. They can also free up human customer service agents by providing quick and accurate responses to customer inquiries and automating routine tasks, which reduces call center volume. They are available 24/7 and can personalize recommendations and content by taking into consideration role, preferences, interests and behaviors. All of these contribute to improved productivity and efficiency. Right now, bots are only valuable in very narrow use cases and are unable to handle complex tasks. However, the field is rapidly changing and advances in algorithms are having a very significant impact.

[February 15] Demystifying Knowledge Graphs – Applications in Discovery, Compliance and Governance

A knowledge graph is a type of data representation that utilizes a network of interconnected nodes to represent real-world entities and the relationships between them. This makes it an ideal tool for data discovery, compliance, and governance tasks, as it allows users to easily navigate and understand complex data sets. In this webinar, we will demystify knowledge graphs and explore their various applications in data discovery, compliance, and governance. We will begin by discussing the basics of knowledge graphs and how they differ from other data representation methods. Next, we will delve into specific use cases for knowledge graphs in data discovery, such as for exploring and understanding large and complex datasets or for identifying hidden patterns and relationships in data. We will also discuss how knowledge graphs can be used in compliance and governance tasks, such as for tracking changes to data over time or for auditing data to ensure compliance with regulations. Throughout the webinar, we will provide practical examples and case studies to illustrate the benefits of using knowledge graphs in these contexts. Finally, we will cover best practices for implementing and maintaining a knowledge graph, including tips for choosing the right technology and data sources, and strategies for ensuring the accuracy and reliability of the data within the graph. Overall, this webinar will provide an executive level overview of knowledge graphs and their applications in data discovery, compliance, and governance, and will equip attendees with the tools and knowledge they need to successfully implement and utilize knowledge graphs in their own organizations. *Thanks to ChatGPT for help writing this abstract.

Innovation with Utility in Mind - What’s Ahead for 2023

It has been a year of advances in AI with new tools that create art, write essays, and have conversations that are often (but not always) startlingly eloquent.  It is that space of "not always" that keeps these tools from working out of the box to solve actual business problems without human intervention.