IndexR Logo

IndexR is a super fast columnar data format on HDFS, which focus on fast analytic, both for massive static(historical) data and rapidly ingesting realtime data. IndexR is designed for OLAP. IndexR is greatly suitable for building data warehouse based on Hadoop ecosystem.

  • Super fast, 2~4x read speed of Parquet.
  • 3 levels indices supported. Say goodbye to full scan.
  • Support realtime ingestion. No more wait, analyse anything right after they happen.
  • Hardware efficiency, anyone can use.
  • Features like realtime and offline pre-aggregation, online schema update, 100% accurate, etc.
  • Deep integration with Hadoop ecosystem. Adapted with popular query engines like Apache Drill, Apache Hive, etc.

Getting started


Please feel free to file any issues.


  • WeChat: xilyflow
  • QQ Group: 606666586 (IndexR讨论组)


Copyright 2016 Sunteng Tech.

Licensed under the Apache License, Version 2.0 (the "License"); you may not
use this file except in compliance with the License. You may obtain a copy of
the License at

Unless required by applicable law or agreed to in writing, software
distributed under the License is distributed on an "AS IS" BASIS, WITHOUT
WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. See the
License for the specific language governing permissions and limitations under
the License.


An open-source columnar data format designed for fast & realtime analytic with big data.

Indexr Info

⭐ Stars 443
🔗 Source Code
🕒 Last Update 8 months ago
🕒 Created 6 years ago
🐞 Open Issues 10
➗ Star-Issue Ratio 44
😎 Author shunfei