ACL Management for Spark SQL

ACL Management for Spark SQL

Three primary modes for Spark SQL authorization are available with spark-authorizer:

Storage-Based Authorization

Enabling Storage Based Authorization in the Hive Metastore Server uses the HDFS permissions to act as the main source for verification and allows for consistent data and metadata authorization policy. This allows control over metadata access by verifying if the user has permission to access corresponding directories on the HDFS. Similar with HiveServer2, files and directories will be tanslated into hive metadata objects, such as dbs, tables, partitions, and be protected from end user's queries through Spark SQL as a service like Kyuubi, livy etc.

Storage-Based Authorization offers users with Database, Table and Partition-level coarse-gained access control.

Please refer to the Storage-Based Authorization Guide in the online documentation for an overview on how to configure Storage-Based Authorization for Spark SQL.

SQL-Standard Based Authorization

Enabling SQL-Standard Based Authorization gives users more fine-gained control over access comparing with Storage Based Authorization. Besides of the ability of Storage Based Authorization, SQL-Standard Based Authorization can improve it to Views and Column-level. Unfortunately, Spark SQL does not support grant/revoke statements which controls access, this might be done only through the HiveServer2. But it's gratifying that spark-authorizer makes Spark SQL be able to understand this fine-grain access control granted or revoked by Hive.

For Spark SQL Client users who can directly acess HDFS, the SQL-Standard Based Authorization can be easily bypassed.

With Kyuubi, the SQL-Standard Based Authorization is guaranteed for the security configurations, metadata, and storage information is preserved from end users.

Please refer to the SQL-Standard Based Authorization Guide in the online documentation for an overview on how to configure SQL-Standard Based Authorization for Spark SQL.

Ranger Security Support

Apache Ranger is a framework to enable, monitor and manage comprehensive data security across the Hadoop platform but end before Spark or Spark SQL. The spark-authorizer enables Spark SQL with control access ability reusing Ranger Plugin for Hive MetaStore. Apache Ranger makes the scope of existing SQL-Standard Based Authorization expanded but without supporting Spark SQL. And spark-authorizer sticks them together.

Please refer to the Spark SQL Ranger Security Support Guide in the online documentation for an overview on how to configure Ranger for Spark SQL.

最后编辑于
©著作权归作者所有,转载或内容合作请联系作者
平台声明:文章内容(如有图片或视频亦包括在内)由作者上传并发布,文章内容仅代表作者本人观点,简书系信息发布平台,仅提供信息存储服务。

推荐阅读更多精彩内容

  • PLEASE READ THE FOLLOWING APPLE DEVELOPER PROGRAM LICENSE...
    念念不忘的阅读 13,566评论 5 6
  • 我经常能够看见,与很多人喜欢默默付出,帮助别人做了很多事却不声不响。就像很多电视剧里男二的桥段,总是默默守护着女生...
    247J阅读 701评论 0 0
  • 以一个业余爱好文化艺术之人的眼光来打量武汉这座城市,你会从心底里生出一种对这座城市的轻视,感觉它竟然长得四不象,只...
    芭比和剑客阅读 293评论 0 1
  • 第三方分享,今天聊的是友盟分享,官方链接:http://www.umeng.com/social 官方的SDK都能...
    软工官博阅读 969评论 0 0
  • 早上,上班前,这个店吃早餐,门都没有,冷死了。 孩爹的车过来的,他说饿了,吃了早餐再回家,我也还没吃,我平时都是包...
    喊哈是哈阅读 161评论 0 0