A telecommunications companyplanopenexhibitionBigdatabusiness,target businesshavecustomer grouping,calendarHistorical bill analysis, real-time call charge analysis and other services. Which of the following options is the most appropriate in terms of functionality and cost to meet business needs?
When a MapKeduce program runs, the Map? ?malfunction. Which of the following options describe this task correctly?
In the MRS interface, Loader can specify a variety of different data sources, configuration data cleaning and conversion steps, configure cluster storage systems, etC. .
According to the way of data acquisition in Flue, Source can be divided into driven Sourcet and( )Source.
With the continuous increase in the volume of big data, the requirements for the physical security of data storage are getting higher and higher, and higher requirements are also put forward for the multiple copies of data and disaster recovery mechanism.
If the data producer needs to decide to send data to a certain task of the target Bolt. Which of the following message publishing strategies should be selected
Which of the following application scenarios can HBase be used for? (multiple choice)
When a certain task of MapReduce fails, the task can be recalculated through the retry mechanism.
The capacity scheduler is allocating resources. There are two queues Q1 and Q2 at the same level, and their capacities are both 30. Among them, Q1 has used 8 and Q2 has used 14, and resources will be allocated to Q1 first.
YARN manages cluster resources through ResourceManager. What are its main functions?
In Hive architecture, ( ) groupPiece is responsible for table, columnand Partition etC. yuandataRead, write and update operations
YarnWhen doing resource scheduling, maptaak and reduceTask are run in( )middle.
In the Fusioninsight HD system, a Loader node in the cluster is abnormal. If other services are not abnormal, the normal use of the Loader service function will not be affected.
The overall process of Kafka Produceri reading data is that the Producer connects to any surviving Broker, requests the leader metadata information of the specified topic and partition, and then directly connects with the corresponding Brokerl to publish the data.
Batch processing of high-value and highly aggregated information and knowledge is the main business requirement of the big data industry
In the Fusioninsight HD system, if dirty data is generated while the Loader job is running, the status of the Loader job execution result must be failed.
"Group by" in Hive refers to dividing each data set into several small data sets through certain rules, and then performing data grouping processing for several small data sets.
Spark's intermediate data is stored in memory, which is more efficient and has higher latency for iterative operations and batch calculations.
The following aboutHWhich of the descriptions of the Base secondary index is correct?
About HDFSThe function of the NameNode, which one of the following descriptions is wrong?
In Hadoop, if yarn.scheduler.capacity.root.QueueA. minimum-user-limit-percenti is set to 50, which of the following statements is wrong?
When installing the Streaming component of Fusioninsight HD, the Nimbus role requires several nodes to be installed
Hardware failure is considered to be the norm, in order to solve this problem.HDFS has designed a copy mechanism. By default, a file, HDFS will save( )share?
When creating a Loaderf job, in which of the following steps can the filter type be set?
In the Fusioninsight HD system, which of the following methods cannot view the execution result of the Loader job?
What is the module used to manage the active and standby status of the Loader Server process in Loader?
In the Hadoop platform, to view the information of an application in the YARN service, what command is usually needed?
Which of the following descriptions about the basic operations of Hive SQL is correct?
In Fusionlnsight HD cluster planning, manage nodes&control node&What kind of scenarios is the unified data node deployment solution suitable for?
SoIrCloud mode is cluster mode. In this modeSWhich of the following services does the olr server strongly depend on?
In the F1ink technical architecture,( )is a computing engine for stream processing and batch processing
RDD has Transformation and Action operators. Which of the following belongs to the Action operator?
When planning and deploying a Fusionlnsight cluster, it is recommended that the management node be best deployed( ), the control node needs to be deployed at least( )Piece,Data nodes need to be deployed at least( )Piece.
What component does HBase use by default as its underlying file storage system?
A user needs to build a FusionInsightHD cluster with 350 nodes. Which planning solution is the best?
The order in which the YARN scheduler allocates resources, which one of the following descriptions is correct?
In the Fusioninsight product, about the Kafka topic, which of the following descriptions are incorrect?
When the loader in Fusioninsight HD imports files from the SFTP server, which of the following file types does not require encoding conversion and data conversion and is the fastest?
When the Loader of Fusioninsight HD creates a job, what is the function of the connector?
Which of the following is not included in the schemai authentication method of Zookeeper?
When deploying Fusioninsight HD, how many FlumeServer nodes are recommended to be deployed in the same cluster?
Regarding the basic operation of Hive table building, which is the correct description?
In the FusionInsight cluster, which of the following components does Spark mainly interact with?
In the process of using Flume to transmit data, in order to prevent data loss due to the restart of the Flume process. Which of the following Channel types can be used
Which of the following destinations can Fusioninsight HD Loader export HDFS data to?
Which of the following contents can be viewed in the Loader historical job record?
Which of the following designs are mainly considered in the planning process of the big data business consulting service plan?
This command in Hive "ALTER TABLEemployeelADDcolumns(columnlstring);"What does it mean?
In the Fusioninsight product, which statement is correct about the Kafka component?
In Huawei Fusioninsight HD, which of the following components are strongly dependent on Flink?
Which of the following descriptions about the characteristics of Kafka Partition replicas is correct?
Use the Hbase client to write 10 pieces of data in batches. An HRegionServer node contains 2 regions of the table, namely A and B. Among the 10 pieces of data, 6 belong to A and 4 belong to B. Please write How many RPC requests do I need to send to HRegionServer for these 10 pieces of data?
The following figure shows the computational model of Structured Streaming. By observation, it can be concluded that the final calculation result of 3 is
In Flink( )Interface for streaming data processing.( )interface for batch processing
Which of the following statements about CarbonData in Fusioninsight is correct?
SELECT aa.salarybB. address FROM employee aa JoiN SELECT adress FROM employee info where provine='zhejiang') What types of operations does bb ONaa.nanme=bB. name contain?
Which of the following data sources can realize data exchange with Fusioninsight HD through loader?
Which of the following options does the time operation type supported by F1ink include?
Which of the following statements about Fusioninsight HBasel visual modeling are correct?
In Fusioninsight HD, which of the following is not a flow control feature of Hive
From the point of view of the life cycle, what stages does data mainly go through?