# Data Quality

Ensuring clean and consistent data is critical for reliable analysis. This category focuses on data validation, consistency checks, and cleaning.

## **Missing Data & Duplicate Detection**

### Identifying gaps and ensuring data completeness

**Purpose:** Lumi helps users ensure data quality by detecting missing values and identifying duplicates within datasets. By identifying gaps or incomplete data, businesses can prevent operational errors and maintain the integrity of their analytics.

{% hint style="info" %} <mark style="color:blue;">Use Case:</mark> An inventory manager might use Lumi AI to identify missing stock quantities in their warehouse data, ensuring that all products have accurate counts. Lumi enables users to check for missing data and duplicates, ensuring all records are complete and accurate.
{% endhint %}

> **Typical Prompts:**
>
> 1. *"Identify any products in the inventory that are missing stock quantities or SKU numbers."*
> 2. *"Find all duplicate customer records in the CRM system based on customer name and email."*
> 3. *"Check the sales orders from the last 12 months for missing shipping addresses or incomplete payment information."*
> 4. *"Which suppliers are missing contact details in the vendor database? Include the vendor ID and company name."*
>
> **Example Output:&#x20;*****Are there any actual pick up dates with null values with trip status completed?***

<figure><img src="/files/eammw9bB2IFEfoZ4n7F7" alt=""><figcaption></figcaption></figure>

## **Data Validation: Logic & Consistency Checks**

### Ensuring the data has logical consistency

**Purpose:** Lumi helps users validate data for logical consistency across datasets. This ensures that values within a dataset adhere to expected rules or relationships, such as dates being in the correct range or quantities not being negative.

{% hint style="info" %} <mark style="color:blue;">Use Case:</mark> A warehouse manager can use Lumi AI to ensure that all shipment dates are after order dates, preventing reporting errors. Similarly, inventory records showing zero stock but a positive inventory value would result in inaccurate reporting.
{% endhint %}

> **Typical Prompts:**
>
> 1. *"Find any products where the sale date is earlier than the manufacture date."*
> 2. *"Check all purchase orders to ensure that the unit price is greater than 0 for each item."*
> 3. *"*&#x41;re there any items with negative inventory?"
>
> **Example Output: Do we have any items where the inventory quantity is greater than 0 but the stock value is = 0**

<figure><img src="/files/bUQecA1UYdb9BOieLtC5" alt=""><figcaption></figcaption></figure>

> **Example Output:** **Are there any components that have itself as the BOM?**

<figure><img src="/files/J9ZinuF91r64F67B8A8t" alt=""><figcaption></figcaption></figure>


---

# Agent Instructions: Querying This Documentation

If you need additional information that is not directly available in this page, you can query the documentation dynamically by asking a question.

Perform an HTTP GET request on the current page URL with the `ask` query parameter:

```
GET https://docs.lumi-ai.com/using-lumi/lumi-use-cases/data-quality.md?ask=<question>
```

The question should be specific, self-contained, and written in natural language.
The response will contain a direct answer to the question and relevant excerpts and sources from the documentation.

Use this mechanism when the answer is not explicitly present in the current page, you need clarification or additional context, or you want to retrieve related documentation sections.
