大数据 OLAP 引擎:Presto 概述
![https://chenxqblog-1258795182.cos.ap-guangzhou.myqcloud.com/typora/presto_YGCO9br.png https://chenxqblog-1258795182.cos.ap-guangzhou.myqcloud.com/typora/presto_YGCO9br.png](https://chenxqblog-1258795182.cos.ap-guangzhou.myqcloud.com/typora/presto_YGCO9br.png)
Contents
大数据 OLAP 引擎:Presto 概述
SQL on Anything
presto特点
presto架构
Presto查询引擎是一个Master-Slave的架构,由一个Coordinator节点,一个Discovery Server节点,多个Worker节点组成,Discovery Server通常内嵌于Coordinator节点中。 Coordinator负责解析SQL语句,生成执行计划,分发执行任务给Worker节点执行。 Worker节点负责实际执行查询任务。Worker节点启动后向Discovery Server服务注册, Coordinator从Discovery Server获得可以正常工作的Worker节点。如果配置了Hive Connector,需要配置一个Hive MetaStore服务为Presto提供Hive元信息,Worker节点 与HDFS交互读取数据。
presto比hive快的原因
dag结构,memory to memory
应用场景
etl,adhoc查询
Docker 环境部署
docker pull alluxio/alluxio-presto-sandbox
docker run -d \ –shm-size 1G \ -p 19999:19999 \ -p 8080:8080 \ –name alluxio-presto-sandbox \ alluxio/alluxio-presto-sandbox
docker exec -it alluxio-presto-sandbox bash
presto –catalog hive –debug
或者通过下面的方式部署
https://hub.docker.com/r/ahanaio/prestodb-sandbox
参考资料:
- Presto官网:https://prestodb.io/ Presto下载安装包:https://prestodb.io/download.html
- Presto docker sandbox: https://hub.docker.com/r/ahanaio/prestodb-sandbox
- Presto与Alluxio集成Docker环境上手指南:https://www.alluxio.io/alluxiopresto-sandbox-docker/
- Presto安装配置文档: https://prestodb.io/docs/current/installation/deployment.html
- Presto与Hive集成文档:https://prestodb.io/docs/current/connector/hive.html
- Presto SQL语法文档:https://prestodb.io/docs/current/sql.html