Data Quality
Last updated
Was this helpful?
Last updated
Was this helpful?
Ensuring clean and consistent data is critical for reliable analysis. This category focuses on data validation, consistency checks, and cleaning.
Purpose: Lumi helps users ensure data quality by detecting missing values and identifying duplicates within datasets. By identifying gaps or incomplete data, businesses can prevent operational errors and maintain the integrity of their analytics.
Typical Prompts:
"Identify any products in the inventory that are missing stock quantities or SKU numbers."
"Find all duplicate customer records in the CRM system based on customer name and email."
"Check the sales orders from the last 12 months for missing shipping addresses or incomplete payment information."
"Which suppliers are missing contact details in the vendor database? Include the vendor ID and company name."
Example Output: Are there any actual pick up dates with null values with trip status completed?
Purpose: Lumi helps users validate data for logical consistency across datasets. This ensures that values within a dataset adhere to expected rules or relationships, such as dates being in the correct range or quantities not being negative.
Typical Prompts:
"Find any products where the sale date is earlier than the manufacture date."
"Check all purchase orders to ensure that the unit price is greater than 0 for each item."
"Are there any items with negative inventory?"
Example Output: Do we have any items where the inventory quantity is greater than 0 but the stock value is = 0
Example Output: Are there any components that have itself as the BOM?