Your company stores user profile records in an OLTP databases. You want to join the serecords with web server logs you have already ingested into the Hadoop file system.
What is the best way to obtain and ingest these user records?
Apache Spark included in Alibaba E-MapReduce(EMR) is a fast and general-purpose cluster computing
system. It provides high-level APIs in Java, Scala, Python and R, and an optimized engine that supports
general execution graphs. It also supports a rich set of higher-level tools. Which of the following tools
does not be included in Spark?
Score 2
Scenario: Jack is the administrator of project prj1. The project involves a large volume of
sensitive data such as bank account, medical record, etc. Jack wants to properly protect
the data. Which of the follow statements is necessary?
Which node type in DataWorks can edit the Python code to operate data in MaxCompute?
Score 2
Which of the following is not proper for granting the permission on a L4 MaxCompute table to a
user. (L4 is a level in MaxCompute Label-based security (LabelSecurity), it is a required MaxCompute
Access Control (MAC) policy at the project space level. It allows project administrators to control the
user access to column-level sensitive data with improved flexibility.)
Score 2
DataV is a powerful yet accessible data visualization tool, which features geographic information
systems allowing for rapid interpretation of data to understand relationships, patterns, and trends.
When a DataV screen is ready, it can embed works to the existing portal of the enterprise through
______.
Score 2
When odpscmd is used to connect to a project in MaxCompute, the command ______ can be
executed to view the size of the space occupied by table table_a.
Score 2
Which of the following task types does DataWorks support?
(Number of correct answers: 4)