about the followingHBaseofBloomFilterCharacter understanding, which statement is incorrect?
existSpark, which of the following statements about broadcast variables is correct?
for running onMapReduceThe application on the platform that this application depends onjarpackage will be put where?
HDFSThe system time of the node where the client is located is the same as theFusionInsight HDThe system time of the cluster should be maintained
Consistent, if there is a time difference, then the time difference should be less than a few minutes?
existFusionInsight HDUnder the client, to runMapReducegenerated by the applicationjarBag. Which command can be executed?
existHBaseWhich of the following interfaces or classes does not need to be involved in the implementation of business logic for writing data?
aboutFlumeThe characteristics of the collected data, which of the following descriptions are correct?
Solris a high-performance, basedLucenefull-text search service.SolrrightLuceneexpanded,
provides a ratioLuceneA richer query language and a powerful full-text search function are implemented, with a high degree of reliability.
Extensibility. At the same time fromSolr 4.0Version starts, supportsSolrCloudmodel.
existSpark, the accumulator can realize high-speed parallel counter and variable summation; inSparkduring application development,
only inDriverGet the value of this counter on .
when aMapReduceWhen the application is executed, which of the following actions ismapoccurred before the stage?
FusionInsight HD in real-time processing scenarios, what computing frameworks are available? (multiple choice)
FusionInsight HDin, aboutHive UFDSecondary development, is the following description correct? (multiple choice)
in aMapReduceapplication,mapThe output of the function is viaMapReduce? ?After processing, send toreduceletter
number. This process belongs to? ?Sort and group pairs.
FusionInsight HD V100R002C60in, aboutHiveofPythonInterface type, which of the following descriptions is incorrect?
forSpark Streamingapplication, in aJVM, there can only be one at a timeStreamingContextactive condition.
existStreamingin application development,BoltUse which of the following interfaces to sendTuple?
existFusionInsight HDmiddle,FlumeWhich of the following are supportedsourceTypes of? (multiple choice)
FusionInsight HDmiddle,OozieBefore submitting the job, you need to upload the configuration files andjarpackage toHDFS
RedisofLISTData structure, suitable for which of the following scenarios? (multiple choice)
aboutStreamingthe topology (Topology), which of the following descriptions is wrong?
FusionInsight HDofHiveIn the application, there are the following scenarios:? ? ?Storage files have higher? ?efficiency, and most
Minute? ?Only a part of the letter is involved in the file, this scenario is suitable for using a column file (ORC F??)storage.
FusionInsight HD which components are provided externallySQLor classSQLability? (multiple choice)
Fusionlnsiht HDmiddle,Oozieclient'sJava APIwill be called when the task is runOozieClientWhich method of the class?
existSpark, assuminglinesIs anDStreamobject,filterStatements can be filtered out80%data for the following two
The correct statement is:
X: lines.filter(…).groupByKey(…)
Y: lines.groupByKey(…).filter(…)
FlumewriteHDFSWhen the file is generated, what are the ways of generating the file? (multiple choice)
A project needs to save the Internet access data in a certain area, and search the full text of these Internet access records to see if there is any sensitive data.
Sensitive information is used to prevent crimes in this area. In this scenario, which of the following options is the best?
Suppose there is an application that needs to be accessed frequentlyOracleThe user table in the database, in order to improve performance, introduceRedisto cache
account information.
For this scene,RedisWhich of the following is the best data structure choice for ?
FusionInsight HDsystematicV100R002C60version,HiveOnly supports based onMapReduceEngine query service, not supported based onSparkEngine query service.
FusionInsight HDin, belonging toStreamingWhat are the roles of the service? (multiple choice)
existStreamingin application development,BoltUse which of the following interfaces to sendTuple?
aboutStreamingdisaster recovery capability, which of the following statements is correct? (multiple choice)
Fusionlnsigt HD the user wants to passHBase shelloperation to query aHBaseThe contents of the table, this scenario is pushed down
It is recommended that the administrator assign a machine account to this user.
FusionInsight HDin, aboutOozieWhich of the following descriptions is correct? (multiple choice)
forHBase rowkeyThe design principles described below are correct? (multiple choice)
FusionInsight HD V100R002C60in, aboutHiveofPythonInterface type, which of the following descriptions is incorrect?
HDFSThe system time of the node where the client is located is the same as theFusionInsight HDThe system time of the cluster should be consistent, if there is time
difference, then the time difference should be less than a few minutes?
FusionInsight HDin useStreamingofA, CKWhich of the following statements is true? (multiple choice)
RedisofLISTData structure, suitable for which of the following scenarios? (multiple choice)
FusionInsigt HDWhat distributed computing frameworks do big data platforms provide?
(multiple choice)
When the cluster is normal,RedisClient initiates oncegetCall, the client has () times of message interaction with the server?
RDDasSparkmost__________The core object has which of the following characteristics? (multiple choice)
when aMapReduceWhen the application is executed, which of the following actions ismapoccurs before the stage of?
Spark SQLIn the table, there are often many small files (the size is much smaller thanHDFSblock size), in this case,Sparkwill enable aTaskto process these small files, whenSQLexist in operationShufleWhen operating, will greatly increasehashThe number of dynamic buckets will seriously affect the performance.
aboutFusonInsight HDofSpark, which of the following programming languages can be used to developSparkapplication? (multiple choice)
due toSparkis a memory-based computing engine, therefore, aSparkThe amount of data the app can handle Can't give more than thisSparkThe total memory of the application.
FusionInsight HDWhich of the following belong toOozieofMapReduce Actionconfiguration item? (multiple choice)
InstallFusionInsight HDofStreamingcomponents,NimbusHow many nodes does the role require to install?
FHumeofproperties.propertiesMultiple configurations can be configured in the configuration filechannelto transmit data.
FusionInsight HD assuming a topology that setsspoutConcurrency is3,bolt1Concurrency
for2,bok2Well degree is3.workerThe number is2,Sobolt1ofexecutorexistworkerhow to divide Cloth?
FusionInsightHDin, aboutHivepartition (partation) function, which is wrong as described below?
FusionInsight HDWhich components in the platform support table and column encryption?
(multiple choice)
existKafkamiddle,ProducerThis can be done by configuring the synchronization parameters (producer.type), to ensure that the data press
existStreamingin application development,BoltThe class uses which of the following interfaces to sendTuple?
There are the following business scenarios: User online log files have been stored inHDFSabove, the log file content format the formula is: each online record has three fields, namely name, gender, and online time, and the fields are separated by ",";
It is required to print out all female netizens who spend more than two hours online. Which of the following code snippets can achieve
The above business scenario? (multiple choice)
aboutFusionInsight HDplatformHiveservice, itsWebHCatDevelopment interface, the following description does not the correct one is?
HBasetablerowkeyDesign is a very important development and design link. Suppose there is the following scenario,
The most frequent query scenario is to query the historical call records of each month and half a year based on the mobile phone number. Which of the followingrowkey
Design is optimal?
Spark Streamingavailable fromKafkaReceive data and perform calculations, and the calculation results can only be stored inHDFS,
can't write backKafka.
HadoonIn the system: ifHDFSThe backup factor of the file system is3,SoMapReduceevery time
In progresstaskall from3The segment of the file that needs to be processed is transmitted on a machine with a replica.
FusionInsight HDin, useStreamingofLinuxWhen submitting a topology in command line mode, you need to
first use a possessedStreamingUser with submit permissionkinitway authentication