ClickHouse vs. Elasticsearch FAQ | Answers to Common Questions

link管理

链接快照平台

输入网页链接，自动生成快照
标签化管理网页链接

相关文章推荐

豪气的墨镜 · Multiple problems ...· 1 周前 ·

怕考试的日记本 · 就用集显行不行？Radeon ...· 1 月前 ·

安静的斑马 · 奇漫屋漫画下拉式古风漫画在线看(Alicea ...· 2 月前 ·

暗恋学妹的红金鱼 · makefile 函数 - 下夕阳 - 博客园· 2 月前 ·

彷徨的酱牛肉 · 天正T20 ...· 4 月前 ·

坐怀不乱的椰子 · C#S7.NET实现西门子PLCDB块数据采 ...· 6 月前 ·

Altinity.Cloud

Managed service for ClickHouse in any AWS, GCP, and Azure region or your own VPC

Support for ClickHouse

Get 24/7 Support or POC and Evaluative Support

Training for ClickHouse

Altinity Administrator Training

Pricing for managed ClickHouse

Calculator for managed service for ClickHouse

Customer stories

See why our customers love us

Altinity Plans and Features

Managed service or support? Compare plans and features

The key trade-off is highly compressed, columnar storage vs. full-text search on documents.

ClickHouse is a SQL data warehouse designed for extremely fast queries on large datasets. It uses columnar storage, compression, parallel query, and materialized views to reduce response time by a factor of 1000 over databases like MySQL or PostgreSQL. ClickHouse stores data in tabular format with a sparse “primary key” index and a predefined table order. It can define new columns and change column format instantly. Changing the primary key or order columns generally requires the table to be reloaded.

Elasticsearch is a search engine designed to run queries on documents containing semi-structured data like JSON log records. It implements full text search, which allows it to query document data without converting to tables. Elasticsearch is based on Lucene , which provides indexed storage. Elasticsearch requires users to set fixed types to index data; or Elasticsearch can dynamically map data types itself. Either way, you must reindex to change data types once they are set.

ClickHouse is released under the popular open source Apache 2.0 License . Users are free to add ClickHouse to proprietary products, use it to build SaaS offerings, and run managed ClickHouse services. There are no limitations to the type of business you can support.

Elasticsearch originally released under Apache 2.0 but has moved away. Users have a choice of the proprietary Elastic License v2 or the Server Side Public License (SSPL) 1.0 . Both licenses ensure access to source code but place significant limitations on usage of Elasticsearch, especially for SaaS businesses.

Uber moved to ClickHouse from Elasticsearch to manage service logs at massive scale. ContentSquare successfully migrated web analytics processing from Elasticsearch to ClickHouse.

ClickHouse excels in many other use cases as well: rapid valuation of financial assets, network flowlog analysis, intrusion detection, real-time marketing, CDN management, and observability applications, to name a few. ClickHouse performance in these use cases meets or exceeds any other analytic database.

ClickHouse performance is a clear winner over Elasticsearch for tabular or near-tabular data. Uber measurements show ingest time into ClickHouse was capped at 1 minute max and multi-region queries returned in seconds without degradation under high load. ContentSquare found that queries were 4 times faster overall and 10 times faster at 99th percentile latencies. Alibaba performance tests showed that ClickHouse outperformed Elasticsearch –often substantially–across a range of use cases.

Uber reduced their cluster footprint on ClickHouse by over 50% while serving more queries than with Elasticsearch. ContentSquare reported that ClickHouse was 11 times cheaper than Elasticsearch. ClickHouse runs well even on very small devices, such as Intel NUCs , where it can handle datasets running to hundreds of billions of records.

ClickHouse Apache 2.0 licensing enables a worldwide market for managed ClickHouse in public clouds. There are many such services including our own Altinity.Cloud in AWS and GCP. Users can easily move back and forth between managed ClickHouse and on-prem operation.

Elasticsearch licensing prevents vendors other than Elastic from running managed Elasticsearch for current software versions. Competitors must fork the older, Apache 2.0-licensed version, as Amazon has done . Managed Elasticsearch services may diverge in future, and compatibility with on-prem versions cannot be guaranteed.

ClickHouse security has improved rapidly over the past couple of years. It now offers LDAP user management , role-based access control (RBAC) , Kerberos authentication , column encryption functions , high-performance TLS connections, and many other features. Altinity offers FIPS-compatible ClickHouse builds and works with customers to enable deployment into FedRAMP environments .

Grafana is a popular choice for building dashboards quickly on ClickHouse data. The community-supported plugin is stable and widely used. Superset is another popular choice. ClickHouse client library support is excellent, which makes it easy to embed analytics in Javascript, Python, Golang, and Java applications.

ClickHouse has a number of popular options for loading log data including Vector , FluentD , and Kafka . You can also import log data directly using ClickHouse table functions , which can read from files, S3 object storage, HDFS, and other sources. Here’s an example of loading compressed file system data using the file table function .

INSERT INTO mytable 
  SELECT * FROM file('logfile.log.gz', 'Template', 'col1 …, colN …');

ClickHouse generally gets the best performance on data stored in table columns with proper data types. Scans on unstructured data are more expensive.

One popular ClickHouse pattern is to store the original unstructured doc in a table and extract enough attributes into columns to cover the majority of queries. PixelJets documented how ClickHouse makes it easy to extract data from common formats like JSON into high performance columnar format. ClickHouse also has a JSON datatype. It works efficiently for simple JSON documents whose schema does not vary.

More importantly, ClickHouse offers skipping indexes (ngrambf_v1, tokenbf_v, bloom filter , etc.), which can reduce I/O by 95% or more. In addition, it’s unbeatable in full scans thanks to features like compression, vectorized query processing, and efficient distributed query. In specific use cases like log search, these features outperform Elasticsearch full-text indexing and also use far less storage space.

Running ClickHouse on Altinity.Cloud cut our costs by 25% and boosted our performance. Even better, we can also store data with higher granularity with ClickHouse–down to mouse click moves–than we could have done with Elasticsearch. We can show aggregate results to users and let them drill down into detailed source data for deeper, more insightful analysis than was previously possible.
Software engineer, Product Analytics Company

Privacy Policy | Site Terms | Security | Legal | 2001 Addison Street, Suite 300, Berkeley, CA, 94704, United States | ©2022 Altinity Inc. All rights reserved. Altinity®, Altinity.Cloud®, and Altinity Stable® are registered trademarks of Altinity, Inc. ClickHouse® is a registered trademark of ClickHouse, Inc.; Altinity is not affiliated with or associated with ClickHouse, Inc. Kubernetes, MySQL, and PostgreSQL are trademarks and property of their respective owners.

To provide the best experiences, we use technologies like cookies to store and/or access device information. Consenting to these technologies will allow us to process data such as browsing behavior or unique IDs on this site. Not consenting or withdrawing consent, may adversely affect certain features and functions.

The technical storage or access that is used exclusively for statistical purposes. The technical storage or access that is used exclusively for anonymous statistical purposes. Without a subpoena, voluntary compliance on the part of your Internet Service Provider, or additional records from a third party, information stored or retrieved for this purpose alone cannot usually be used to identify you.