Ask AI

Data Analyst Agent: Best Practices & How to Use It

1. What Can You Build with the Data Analyst Agent?

The Sapience Data Analyst Agent helps users analyze structured data files such as CSV and Excel without requiring technical or coding skills.

It is designed to support:

  • Data understanding and validation
  • Performance analysis and benchmarking
  • Clear visualizations and summaries
  • Repeatable, explainable analysis suitable for business and enterprise use

This guide explains how to use the agent step by step, from first interaction to advanced analysis.

Notion image
Notion image
 
Notion image
 
Notion image

1.1 Start with a Conversation

Begin by opening a chat with the Data Analyst Agent. Use this stage to:

  • Understand what the agent can and cannot do
  • Clarify your objective (e.g. comparison, performance review, benchmarking)
  • Confirm whether your data is suitable for analysis

This step helps avoid unnecessary rework later.

Notion image
 

1.2 Upload Your Data File

Upload a structured dataset, preferably:

  • CSV

Best practice:

  • One table per file
  • Clear column headers
  • No merged cells or complex formatting
 

Once uploaded, the agent prepares the data for inspection.

Notion image
 

1.3 Data Profiling

Data profiling is an initial review of your dataset to understand:

  • What data exists
  • How it is structured
  • Whether there are missing or inconsistent values

This step ensures that all further analysis is based on accurate assumptions.

The agent will:

  • Identify rows and columns
  • Detect data types (numeric, text, dates)
  • Highlight missing or empty values
  • Generate a data dictionary explaining each field
  • Summarize distributions and basic statistics
 
Notion image
 

1.4 Outputs You Receive

The agent provides downloadable outputs, including:

  • Interactive HTML profile report
  • PNG images of distributions
  • CSV summary tables
  • JSON profile (for reference or reuse)

These outputs are optional to download but always available.

Notion image
 

1.5 Handling Missing Data

Missing values are common in real datasets.

By default, the agent:

  • Flags missing values clearly
  • Uses available (non-missing) values for calculations
  • Explains any assumptions made

You can always request:

  • Alternative handling of missing data
  • Clarification on how missing values affect results

Example:

“Explain how missing data impacted this analysis.”
Notion image
 
 

2. Walkthrough: Creating Your First Visual

Descriptive analysis answers the question: “What does the data show?”

You can request:

  • Overall score summaries
  • Criterion-level breakdowns
  • Direction, Execution, and Results analysis
  • Trends across years or groups
 

2.1 What You’ll Receive

  • A written summary of findings
  • Key insights explained in plain language
  • Charts and tables to support conclusions
  • Suggested next steps for further analysis

If unsure what to analyze next, follow the Suggested Areas of Further Exploration provided by the agent.

 
Notion image
 
Notion image
 

2.2 Organization-Level Analytics

Use this step to compare one organization against others.

Common use cases:

  • Ranking against top-performing organizations
  • Comparison with peer groups (e.g. scores above a threshold)
  • Country, sector, or regional benchmarking
 

The agent highlights:

  • Relative position
  • Performance gaps
  • Clusters and outliers

All comparisons are explained in simple terms.

Notion image
 
Notion image
 
Notion image
 
Notion image
 

2.3 Criterion-Level Analysis

Criterion-level analysis focuses on performance by dimension or framework criteria.

The agent can:

  • Compare scores across criteria
  • Identify strongest and weakest areas
  • Benchmark against peer averages
  • Visualize results using bar and radar charts

This step is useful for identifying areas of strength and improvement.

Notion image
 
Notion image

Notion image

Notion image

2.4 Approach-Level Analysis

(Direction, Execution, Results)

This analysis focuses on how performance is achieved, not just outcomes.

You can:

  • Compare Direction, Execution, and Results internally
  • Benchmark against peers, sectors, or countries
  • Identify alignment gaps between strategy and outcomes

Results are explained clearly, without technical language.

Notion image

Notion image
 
Notion image
 

2.5 Advanced Analytics

Advanced analytics help answer deeper questions, such as:

  • What factors contribute most to strong performance?
  • Which dimensions have the greatest impact on results?

These analyses may take longer to run due to complexity.

Best practice:

  • Break complex questions into smaller requests
  • Run advanced analysis only after completing earlier steps
 
Notion image
Notion image

Notion image
 
Notion image
 
Notion image
 
Notion image
 

2.6 Generate a Full Analysis Report

You can request a comprehensive report that:

  • Summarizes all findings
  • Includes key visuals and interpretations
  • Is suitable for leadership review or documentation

Example request:

“Create a full analysis report summarizing all key findings from this dataset.”
Notion image
 

3. References: Common Visualization Types

Choosing the right chart matters more than most people think.

The wrong chart can confuse, distract, or even mislead.

The right one makes insights obvious in seconds.

Below are the most common visualization types used by the Data Analyst Agent, explained in human terms, what they show, when to use them, and what to avoid.

 

3.1 Histogram

 
Notion image
 

What it shows

A histogram shows how values are distributed across ranges (or “buckets”).

Instead of comparing categories, it answers: How often does something happen?

 

Plain-English example

“How many organizations scored between 60–70, 70–80, 80–90?”

 

Best used when

  • You want to understand spread, variation, or patterns
  • You’re checking if results cluster around certain values
  • You’re looking for skewed data or unusual peaks
 

Best practices

  • Use consistent bin sizes
  • Label axes clearly (score ranges, frequency)
  • Pair with a short explanation (“Most scores cluster around…”)
 

Avoid when

  • You want to compare named categories (use bar charts instead)
  • You have very few data points
 

3.2 3D Scatter Plot

Notion image
 
 

What it shows

Relationships between three numerical variables at once.

 

Plain-English example

“How Direction, Execution, and Results relate to each other.”

 

Best used when

  • Exploring complex relationships
  • Identifying clusters or outliers
  • Advanced analytical exploration
 

Best practices

  • Label axes clearly
  • Rotate or interact with the chart if possible
  • Use sparingly and explain the takeaway
 

Avoid when

  • Presenting to non-analytical audiences without explanation
  • Simpler 2D charts can do the job
 

3.3 Bar/Column Charts

Notion image
Notion image
 

What it shows

Compares values across categories.

This is the most familiar and most reliable chart type.

 

Plain-English example

“Which organization scored highest?”

“How do criteria compare side by side?”

 

Best used when

  • Comparing organizations, criteria, or groups
  • Ranking performance
  • Showing differences clearly and quickly
 

Best practices

  • Keep category labels readable
  • Sort bars when showing rankings
  • Use horizontal bars if labels are long
 

Avoid when

  • Showing trends over time (use a line chart)
  • Showing distributions (use a histogram)
 

3.4 Radar Chart

Notion image
 

What it shows

Performance across multiple dimensions on the same scale.

 

Plain-English example

“How balanced is performance across EFQM criteria?”

 

Best used when

  • Comparing strengths and weaknesses across dimensions
  • Showing balance or imbalance at a glance
  • Comparing one entity to a benchmark
 

Best practices

  • Use consistent scales across all axes
  • Limit comparisons (1–2 entities is ideal)
  • Pair with a short interpretation
 

Avoid when

  • You need precise numerical comparison
  • Comparing many entities at once
 

3.5 Line/Trend Chart

Notion image
 

What it shows

How values change over time or across an ordered sequence.

 

Plain-English example

“Did performance improve year over year?”

“Are results trending up or down?”

 

Best used when

  • Showing movement, progress, or change
  • Comparing trends across multiple groups
 

Best practices

  • Use time or logical order on the X-axis
  • Limit the number of lines to avoid clutter
  • Highlight key turning points
 

Avoid when

  • You’re comparing static categories
  • Time order isn’t meaningful
 

3.6 Heatmap

Notion image
 

What it shows

Intensity or strength of values using color.

 

Plain-English example

“Where are scores strongest or weakest across criteria and organizations?”

 

Best used when

  • Spotting patterns quickly
  • Comparing two dimensions at once
  • Highlighting outliers or clusters
 

Best practices

  • Use intuitive color scales (light → dark)
  • Include a legend
  • Explain what “high” and “low” mean
 

Avoid when

  • Exact values matter more than patterns
  • Color perception may cause confusion without labels
 

3.7 Dashboard

Notion image

What it shows

A curated collection of charts that tell a story together.

 

Plain-English example

“A one-page performance overview for leadership.”

 

Best used when

  • Monitoring performance
  • Supporting decision-making
  • Combining multiple perspectives into one view
 

Best practices

  • Keep it focused on one question or theme
  • Arrange charts in a logical reading flow
  • Add short annotations or summaries
 

Avoid when

  • You haven’t clarified the purpose
  • You’re trying to show everything at once
 

3.8 Quick Rule of Thumb

💡
  • Compare things → Bar chart
  • Track change → Line chart
  • Understand spread → Histogram
  • See balance → Radar chart
  • Spot patterns → Heatmap
  • Explore relationships → Scatter plot
  • Summarize everything → Dashboard
 

4. Best Practices

💡
  • Use clean, structured datasets
  • Start with data profiling before analysis
  • Ask one clear question at a time
  • Follow suggested next steps to build a logical flow
  • Run analyses incrementally rather than all at once
 

5. Known Limitations

💡

Occasionally:

  • Large analyses may take longer to complete
  • Long-running tasks may time out
  • Visual outputs may stop rendering in extended sessions
  • The agent can’t download a report in PDF or Word format for now.

If this occurs:

  • Rerun the analysis in smaller steps
  • Ask the agent to continue from the last completed point
  • Coy and paste report and visuals in your local file.
 
Did this answer your question?
😞
😐
🤩