Question 1

What is Savvina AI?

Accepted Answer

Savvina AI is a self-hosted, natural language analytics tool. You connect it to your database, ask questions in plain English, and it generates and runs the SQL — then returns clean results and charts. No SQL knowledge required.

Question 2

Who is it for?

Accepted Answer

Anyone who needs answers from a database but doesn't write SQL — executives, finance teams, product managers, sales, HR, and ops. It's also useful for data teams who want to offload routine lookups.

Question 3

Is there a cloud version?

Accepted Answer

Not yet. A cloud-hosted edition is planned for a later date. Today, Savvina runs entirely on your own infrastructure — on-prem, in your VPC, or on AWS / Azure / GCP.

Question 4

Which databases does Savvina support?

Accepted Answer

The Community Edition supports PostgreSQL and MySQL/MariaDB. The commercial edition adds MongoDB, Snowflake, BigQuery, Redshift, Databricks, ClickHouse, SQL Server, Athena, Trino, and object storage sources (S3, GCS, MinIO, R2, Azure Blob).

Question 5

Which LLM providers are supported?

Accepted Answer

Claude (Anthropic), OpenAI, Groq, Google Gemini, Cerebras, Mistral, Ollama, and any OpenAI-compatible endpoint — including GitHub Models, HuggingFace, Together.ai, OpenRouter, vLLM, and LM Studio. You can switch providers per session.

Question 6

Can I run it completely offline?

Accepted Answer

Yes. Using Ollama with a locally pulled model (e.g. llama3 or qwen2.5-coder), Savvina runs with zero internet egress. No data ever leaves your machine.

Question 7

Do I need to pay for an LLM API?

Accepted Answer

Not necessarily. Groq offers 14,400 free requests per day, and Google Gemini offers 1,500 free requests per day — either is enough to get started at no cost.

Question 8

Does my data get sent to the LLM?

Accepted Answer

Never. Only your schema description and your natural language question are sent to the LLM. Query results stay entirely within your infrastructure.

Question 9

Can I control what the LLM sees?

Accepted Answer

Yes, with column-level precision. You can exclude specific schemas, tables, or columns entirely. Sensitive patterns like email, ssn, and password are auto-flagged. You can also toggle whether sample values, column comments, or row counts are included in the prompt.

Question 10

Is it safe to run against a production database?

Accepted Answer

Savvina only ever issues SELECT statements. Every generated query is validated before execution — writes are not possible by design.

Question 11

How are credentials stored?

Accepted Answer

Database credentials and LLM API keys are encrypted at rest using Fernet (AES-128) before being stored. They never appear in logs or backups.

Question 12

What does the audit log record?

Accepted Answer

The audit log captures who ran a query, when, and which model was used — but never the actual data values returned.

Question 13

How long does it take to get running?

Accepted Answer

About 15 minutes. Clone the repo, copy .env.example, generate three secrets (Fernet key, JWT secret, DB passwords), add a free LLM API key, run docker compose up --build, and open https://localhost:3000.

Question 14

What are the minimum requirements?

Accepted Answer

Docker + Docker Compose v2, and at least 4 GB RAM (8 GB recommended if running Ollama locally).

Question 15

Can I try it without a real database?

Accepted Answer

Yes. The repo includes pre-seeded demo PostgreSQL and MySQL databases you can spin up with docker compose --profile test-dbs up --build.

Question 16

Does Savvina execute queries automatically?

Accepted Answer

It depends on your configuration. There are three execution modes: Auto-execute (runs immediately), Review-first (shows the SQL for approval before running), and Generate-only (returns the SQL only, for a DBA to run manually). You set the mode per connection.

Question 17

How does accuracy improve over time?

Accepted Answer

Every thumbs-up on a result is added to a few-shot example library. The next time a similar question is asked, that example is included in the prompt — so accuracy compounds with use, without any manual tuning.

Question 18

Does Savvina cache queries?

Accepted Answer

Yes. A two-level cache combines exact-match and semantic-similarity matching. In production, this typically reduces LLM calls by 40–60%, cutting both latency and inference cost.

Question 19

Is Savvina free?

Accepted Answer

The Community Edition is free for development, testing, and non-commercial use under the Business Source License 1.1. Production or commercial use requires a commercial license. On June 1, 2030, the project converts to Apache 2.0.

Question 20

When will the production edition be available?

Accepted Answer

The self-hosted commercial edition is coming soon. You can join the waitlist at savvina.ai or contact info@savvina.ai for more information.

Question 21

Is there per-query pricing?

Accepted Answer

No. Savvina has no query metering. You bring your own LLM key and pay only for whatever that provider charges — or nothing at all if you use a free tier or Ollama.

Question 22

What is the Semantic Model?

Accepted Answer

The Semantic Model is a business-context layer that sits between your raw database schema and the LLM. Without it, the LLM sees column names like cx_tp_cd or ord_sts and has to guess what they mean. With it, the LLM knows exactly that ord_sts = 'C' means Confirmed, that total_amount should always be summed and formatted as currency, that one row in orders represents one order, and that cancelled orders should be excluded from revenue by default. It's the difference between a model that produces plausible-looking SQL and one that produces correct SQL for your specific data.

Question 23

How is the Semantic Model generated?

Accepted Answer

After connecting a database and refreshing the schema, you trigger generation from the Semantic Model tab. Savvina runs a three-phase pipeline: it introspects your schema, sends batches of up to 4 tables at a time to the LLM for annotation, then makes a final call to generate cross-table business metrics, derived columns, join patterns, and named segments. The whole process takes 10–60 seconds depending on schema size.

Question 24

What exactly does it contain?

Accepted Answer

The model is rich. For each table it stores a human-readable display name, a plain-English description, a grain statement (what one row represents), default SQL filters that are always applied, the primary timestamp and date columns, partition and cluster hints, and hierarchies for drill-down. For each column it adds a display name, a semantic type (monetary, identifier, status flag, etc.), cardinality, value mappings for coded fields, sensitive flags, and the correct aggregation function. At the cross-table level it defines named Business Metrics, Derived Columns with exact SQL expressions, Segments, Relationships, and Time Expressions.

Question 25

What is "Check Drift" and when should I use it?

Accepted Answer

Drift detection compares your saved Semantic Model against the live schema — no LLM call, no database query, just a structural comparison. It will warn you about tables or columns that have been removed since generation, and flag any Business Metrics that reference tables which no longer exist. Run it after any schema migration or significant schema change. It does not auto-fix the model; it only tells you what's out of sync so you can decide whether to regenerate or patch manually.

Question 26

If I regenerate, do I lose my edits?

Accepted Answer

Yes. Regeneration replaces the entire model. If you've spent time refining metric definitions and value mappings, export or note them before regenerating. After a minor schema change (adding a new table or a few columns), it's usually better to add or edit only the affected parts manually and run Check Drift, rather than regenerating the whole model.

Question 27

What parts of the model matter most for query accuracy?

Accepted Answer

In order of impact: default filters on tables (these enforce domain constraints on every query), Business Metric filters (make sure status values match your actual data), value mappings on status and type columns (prevent the LLM from hallucinating values that don't exist), and grain statements (prevent incorrect fan-out joins). Descriptions and display names help with question understanding but have less impact on correctness.

Question 28

What happens if my schema is very large and the model doesn't fit in the LLM's context window?

Accepted Answer

Savvina compresses the prompt progressively before dropping content. First, few-shot examples are removed. Then the schema is compacted (per-table DDL and relationships are stripped, but Business Metrics, Segments, and Time Expressions are preserved). If it's still too large, the full semantic model is dropped. To mitigate this: use a provider with a larger context window (Claude or GPT-4o), exclude unused schemas and tables via connection privacy settings, and keep descriptions concise. Business Metrics and Segments survive compression the longest — prioritise filling those in if you can only do part of the model.

Question 29

What is the Example Library?

Accepted Answer

The Example Library is a few-shot learning system that accumulates question–SQL pairs over time. Every time a user gives a thumbs-up to a result, that question and the SQL Savvina generated for it are stored and embedded. On future queries, semantically similar past examples are retrieved and included in the LLM prompt as demonstrations — showing the model exactly how your questions map to correct SQL for your specific schema.

Question 30

How does it improve accuracy over time?

Accepted Answer

Out of the box, Savvina generates SQL from the schema and Semantic Model alone. After a few weeks of use and positive feedback, the prompt also contains 3–5 worked examples of questions your team has actually asked, with the SQL that produced the right answers. This compounds: the more questions get a thumbs-up, the more precisely the LLM can handle edge cases, ambiguous phrasing, and your organisation's particular terminology.

Question 31

What triggers an example being added?

Accepted Answer

Only explicit thumbs-up feedback from a user. Neutral sessions and thumbs-down do not add to the library. This means the library only learns from verified-correct answers — low-quality feedback doesn't pollute it.

Question 32

What happens with a thumbs-down?

Accepted Answer

Thumbs-down does not remove an existing example from the library. It is primarily a signal for you to investigate and, if needed, correct the Semantic Model that led to the wrong result.

Question 33

Can I add examples manually?

Accepted Answer

Yes — through both the UI and the API. Under Settings → Examples Library, there's a "Verified Examples Library" form where you can type a natural language question and its corresponding SQL query, then click Add Example. This is especially useful for pre-seeding the library with your most common or business-critical questions before users start generating organic feedback. You can also use the API for bulk seeding.

Question 34

Is the Example Library shared across users on the same connection?

Accepted Answer

Yes. Examples are stored per-connection, so positive feedback from any user on a shared connection benefits everyone querying that database.

Question 35

Do examples replace the Semantic Model?

Accepted Answer

No — they complement it. The Semantic Model defines the rules; the Example Library demonstrates how those rules play out for real questions. In the event of a very large schema, examples are the first thing dropped from the prompt to make room (the Semantic Model is considered higher priority). For best results, invest in the Semantic Model first and let the Example Library grow naturally through use.

Frequently asked questions

General

Data Sources & LLMs

Privacy & Security

Setup & Deployment

Execution Modes & Accuracy

Pricing & Licensing

The Semantic Model

The Example Library

Still have questions?