Context Orchestration

Context is the foundation — SignalPilot’s agentic harness aggregates information from multiple sources to give AI agents the full picture of your data environment.

Context orchestration is what separates SignalPilot from general-purpose AI assistants like ChatGPT or IDE copilots. The AI doesn’t just see your code — it sees your data, your transformations, your discussions, and your institutional knowledge.

What is Context?

Context is all the information the AI needs to understand your request and provide accurate, relevant assistance:

Notebook State

Cell code and outputs
Kernel variables and dataframes
Execution history
Markdown documentation

Data Sources

Local files (CSV, Excel, Parquet)
Database schemas and metadata
Table relationships
Query performance history

Transformations

dbt models and lineage
SQL transformations
Documentation and tests
Column-level lineage

Institutional Knowledge

Slack discussions about data
Jira tickets and requirements
Past investigation notes
Known data quirks

How Context Works Across Modes

Different agent modes use context differently:

Agent Mode
Hands-On Mode
Ask Mode

Context collection: AutomaticWhat the AI sees:

All notebook cells and outputs
Full kernel state (variables, dataframes)
All database connections
All MCP server connections (dbt, Slack, Jira)

Control: You approve the plan upfront, then the agent has full context to executeBest for: Complex investigations where you want the AI to have complete information

The Context Aggregation Stack

SignalPilot uses a multi-layered approach to gather context:

Layer 1: Notebook Context

Always available — Local notebook state

Cell code: All code cells and their contents
Cell outputs: Results, dataframes, visualizations
Kernel state: Active variables, imported libraries
Execution order: Which cells ran, when, and with what results

This is the baseline context available in all modes.

Layer 1.5: Local Files

Indexed automatically — CSV, Excel, Parquet, and other data files

Default indexed folders:
- ~/SignalPilotHome/data/ (home workspace)
- ./data/ (current working project directory)
@mention files: Type @ in chat to reference any indexed file
Add folders: Use the File Scanner tool in the left sidebar

This is the fastest way to get started — drop files in data/ and @mention them in chat.

Layer 2: Data Warehouse Context

Configured connections — Database metadata and query history

Schema information: Tables, columns, data types
Relationships: Foreign keys, joins, dependencies
Performance metrics: Query execution times, table sizes
Query history: Past queries and their patterns

Connect your data warehouse in Database Integrations to enable this layer.

Layer 3: Transformation Context (via MCP)

dbt integration — Model lineage and documentation

Model definitions: SQL transformations
Column-level lineage: Where data comes from and goes to
Tests and assertions: Data quality rules
Documentation: Model and column descriptions

Configure dbt

Connect dbt Cloud or Core for transformation context

Layer 4: Collaboration Context (via MCP)

Slack, Jira, Notion — Institutional knowledge and discussions

Slack threads: Relevant discussions about data or analyses
Jira tickets: Requirements, known issues, feature requests
Notion docs: Team documentation and procedures
Past investigations: Previous analysis notebooks

MCP Servers

Learn how to connect collaboration tools

Context Management in Practice

Automatic Context (Agent Mode)

In Agent Mode, SignalPilot automatically aggregates all available context:

User: "Why did revenue drop 8% last week?"

SignalPilot gathers:
✓ Revenue dataframe in current notebook (Layer 1)
✓ Schema for revenue_daily table in Snowflake (Layer 2)
✓ dbt model that creates revenue_daily (Layer 3)
✓ Slack thread about A/B test launched last week (Layer 4)

Agent responds:
"I see you have revenue_daily loaded. I also found a Slack discussion
mentioning an A/B test that started last Tuesday. Let me investigate
both the data drop and potential A/B test impact..."

Explicit Context (Hands-On Mode)

In Hands-On Mode, you control what context the AI can access:

Select Cells
@Mention Context
Grant Permissions

Checkbox selection:

Click checkboxes next to cells to add them to context
Only selected cells are visible to AI
Unselect to remove from context

Use case: Focus AI on specific analysis without exposing exploratory work

In chat, type @ to mention:

@sales.csv — Reference local files from data/ folder
@revenue_data — Share specific dataframe from kernel
@cell-5 — Reference specific cell
@postgres-prod — Connect to specific database
@dbt-models — Access dbt lineage

Default indexed folders:

~/SignalPilotHome/data/ (home workspace)
./data/ (current working project directory)

Add more folders: Use the File Scanner tool in the left sidebar to index additional directories.Use case: Surgical precision for context sharing

Context and MCP (Model Context Protocol)

SignalPilot uses MCP to connect to external context sources:

What is MCP?

MCP (Model Context Protocol) is a standard for connecting AI agents to external data sources. Instead of copy-pasting information, the AI can directly query:

dbt Cloud/Core for model lineage
Slack for relevant discussions
Jira for tickets and requirements
Database metadata servers
Custom internal tools

MCP Deep Dive

Learn how MCP works and how to configure servers

Internal vs External MCP

Internal MCP Sidecar (automatically available):

Kernel operations (execute code, introspect state)
Database queries (schemas, performance, history)
Local files (notebooks, CSVs, configs)

External MCP Servers (you configure):

dbt Cloud/Core
Slack
Jira
Notion
Custom tools via MCP specification

Privacy and Security

MCP connections are:

Read-only by default
Explicitly authorized by you
Zero data retention (SignalPilot never stores data)
Local-first (queries run from your environment)

You control:

Which MCP servers to connect
What permissions to grant
When to allow external context

Security Details

Read about SignalPilot’s security architecture

Viewing Active Context

SignalPilot shows your current context in real-time:

Open Context Panel

Click the Context tab in the SignalPilot sidebar to see:

Selected cells
Active dataframes
Connected databases
MCP server connections

Review What's Shared

The panel shows exactly what the AI can see:

✓ Green checkmarks for active context
○ Gray circles for available but not shared
🔒 Lock icons for permission-required resources

Modify Context

Click items to add/remove them from context:

Toggle cell selection
Disconnect databases
Revoke MCP permissions
Clear all context to start fresh

In Hands-On Mode, review the Context Panel before each request to ensure you’re sharing exactly what you intend.

Context Best Practices

Start minimal, expand as needed

Don’t: Share everything upfront “just in case”Do: Start with the minimum context needed, then add more if the AI needs itWhy: Focused context leads to more relevant, faster responses. The AI won’t get distracted by unrelated information.

Use @mentions for precision

Instead of: Selecting all cells that mention revenue_dataDo: Type @revenue_data in chat to explicitly share that dataframeWhy: @mentions are more explicit and easier to track than cell selection

Review before sharing sensitive data

Use Ask Mode to explore context needs

Workflow:

Start in Ask Mode: “What context would help analyze this issue?”
AI explains what it would need
Switch to Hands-On Mode and share that specific context
AI provides targeted assistance

Why: Lets you understand context requirements before sharing anything

FAQ

Does the AI remember context between sessions?

Within a session: Yes, context persists as long as the notebook is open and the kernel is running.Between sessions: No, context resets when you close the notebook. This is by design for privacy.Exception: Plans and notebook state are saved, so the AI can resume work where you left off.

Can I share context from one notebook to another?

What if I accidentally share sensitive data?

How much context is too much?

Signs of too much context:

Slow response times (processing large context)
Irrelevant suggestions (AI distracted by unrelated info)
Generic responses (AI overwhelmed by options)

Solution: Reduce context to the minimum needed for your current task. You can always add more later.

Can I save context configurations?

Not yet, but this is on the roadmap. For now:

Document your typical context needs in a markdown cell
Create templates with common cell selections
Use @mention patterns consistently

Next Steps

Chat Interface

Learn how to use chat to work with context

Planning

See how planning uses context for multi-step tasks

Available Tools

Discover what tools the AI can use with your context

MCP Overview

Connect external context sources via MCP

​Context Orchestration

​What is Context?

Notebook State

Data Sources

Transformations

Institutional Knowledge

​How Context Works Across Modes

​The Context Aggregation Stack

Configure dbt

MCP Servers

​Context Management in Practice

​Automatic Context (Agent Mode)

​Explicit Context (Hands-On Mode)

​Context and MCP (Model Context Protocol)

MCP Deep Dive

Security Details

​Viewing Active Context

​Context Best Practices

​FAQ

​Next Steps

Chat Interface

Planning

Available Tools

MCP Overview

Context Orchestration

What is Context?

How Context Works Across Modes

The Context Aggregation Stack

Context Management in Practice

Automatic Context (Agent Mode)

Explicit Context (Hands-On Mode)

Context and MCP (Model Context Protocol)

Viewing Active Context

Context Best Practices

FAQ

Next Steps