Vicuna

Institutional-Grade Crypto Data

Clean, survivorship-free cryptocurrency pricing data. We did years of cleaning so you can focus on research.

Why Clean Data Matters

Most cryptocurrency datasets suffer from survivorship bias: they only include tokens that are still actively trading, while failed, rugged, or delisted tokens are excluded entirely. This leads to misleading backtests and flawed research conclusions.

On top of that, raw exchange data is riddled with outliers, gaps, and inconsistencies. Cleaning this data properly takes years of dedicated work. We've already done it, and now we make it available to you.

Our Products

Historical Data

Comprehensive OHLCV data from 1-minute to 1-day bars across major cryptocurrency exchanges. Survivorship-bias free with full coverage of delisted assets, cleaned and ready for research.

  • OHLCV bars from 1 minute to 1 day resolution
  • Survivorship-bias free dataset
  • Outlier detection and removal
  • Gap filling and consistency checks

Real-Time Data

Live OHLCV bars delivered in the same clean format as our historical data. Seamless integration for institutions that need production-grade pricing data without the cleaning overhead.

  • Same clean format as historical data
  • Real-time cleaning and validation
  • Consistent across all supported exchanges
  • Ready for live production systems

Why Vicuna Data

Survivorship-Free

Without survivorship-bias free data, your strategy finds patterns that look profitable but are really just patterns every surviving token shares. Rugged and delisted tokens had those same patterns, risking you losing everything. Our datasets include all delisted and failed assets to prevent this.

Years of Cleaning, Done for You

Cryptocurrency pricing data is notoriously messy: outliers, gaps, exchange inconsistencies. We spent years cleaning it. By using our data, you save yourself that time and all the problems that come with it.

Institutional Quality

Built by quant researchers who rely on this data daily. Every data point is validated against multiple sources and cross-checked for accuracy.

Self-Collected

We collect all data ourselves from public sources, keeping the entire collection process survivorship-bias free from the start. No third-party data vendors, no inherited biases.

Multi-Exchange Coverage

Unified pricing data from Binance, ByBit, OKX, and more exchanges coming soon. Consistent format across all venues for seamless integration.

Research-Ready Format

Data is delivered in a standardized, consistent format across all exchanges and timeframes. Load it directly into Python, R, or any analytics platform without additional preprocessing.

Exchange Coverage

We currently cover the largest cryptocurrency exchanges by volume, with more being added continuously.

Binance

Available

ByBit

Available

OKX

Available

More exchanges

Coming soon

Data Delivery

Multiple ways to access our data, designed to fit seamlessly into your existing workflow and infrastructure.

REST API

Programmatic access with real-time and historical endpoints. Integrate directly into your trading systems or research pipelines.

Excel / CSV Downloads

Scheduled or on-demand exports in standard formats, ready for analysis in Excel, Python, R or any other tool.

SFTP Delivery

Automated file delivery to your secure server on a daily or custom schedule.

Custom Integration

Need a different format or delivery method? We work with you to build a solution that fits your infrastructure.

Ready to get started?

Whether you need clean pricing data for research or production, we'd love to hear from you.

Get in Touch