Business Intelligence and Big Data Analysis

Replenishment date: 14.04.2023
Content: Business-Intelligence-and-Big-Data-Analysis-_ANSWER_.doc (91.5 KB)
️Automatic issue of goods ✔️
Sales:
0
Refunds:
0
Reviews:
0
Views:
47
Seller
Seller:
alevtina_sar
Rating:
3,21
Ask a Question
Report a violation
Description
1. The algorithm is:

*instruction to perform actions

* the process of performing calculations leading to the solution of a problem

* a system of rules that describes the sequence of actions that must be performed to solve the problem

2. Business process is:

* Interconnected activities that transform inputs into outputs

*A set of interrelated and interacting activities that transforms inputs into outputs that provide value to the client

*A collection of heterogeneous and significant activities that transforms inputs into outputs that can be useful to the client

*Many types of activities united by the production of one product, service

3. Most of the data mining methods were developed within the framework of…

*theories of artificial intelligence

*classic data analysis

*database theory

4. In which of the following cases is it structured data:

*Data on the sales of the company, presented in the form of reports in MS Excel

*Table with daily readings of room temperature for a year in CSV format

*Book text provided in PDF format

*Movies presented in mpeg format on one hard drive

5. Horizontal scalability in Big Data processing is:

*Expansion of the data processing mechanism with the growth of the volume of data

*Increase in processing speed with the growth of data volume

*Decrease in processing speed with an increase in data volume

*Scaling the presentation of data processing results

6. Decision trees are per group(s)…

*statistical methods

*cybernetic methods

*logical methods

*cross-tab methods

7. The customer of the business process is an official:

*Has at its disposal the means to order the output of a business process

*Has at its disposal the material and information resources of the business process, manages its course, is responsible for the result and efficiency

*Has at its disposal the resources and authority to make decisions on the conduct of work on the description, regulation or audit of the business process

*has at its disposal the necessary tools for designing a business process and its management

8. The main characteristics of Big Data include:

*Virtualization,Volume,Variability,Vehicle

*Variety, Velocity, Volume, Value

*Verification, Volume, Velocity, Visualization

*Video, Value, Variety, Volume

9. How are missing values ​​denoted in R?

10. How to get help in R:

*In R-Studio, you can put the cursor on the function name and press F1

* before the name of the function, you can print a question mark;

*you can use the help() function

11. What is the name of the "boolean" data type in R?

12. What is the name of the "string" data type in R?

13. What is the name of the "integer" data type in R?

14. What is the name of the data type "floating point numbers" in R?

15. Which dplyr function is used to join tables vertically?

*bind()

*bind_rows()

*left_join()

*union()

*bind_cols()

*join()

16. What loops are available in the basic R syntax?

*For

*Which

*Repeat

*While

*Next

*goto

17. How can I declare the variable "a" in the R language:

* a =

*a<-

* a >-

*a !=

18. Big Data locality is:

*Expansion of the data processing mechanism with the growth of the volume of data

*Processing and storage takes place on the same machine

*Communication time cannot be higher than processing time

*Data should not be processed on the storage server

19. The median for sample 1,__,3,7,10,15,16,18 is:

* 7,714286

*7

* 8,5

*Median cannot be calculated due to missing values

20. At what stage of the data life cycle according to the CRISP-DM methodology does hypothesis testing take place?

*Business understanding

*Data Understanding

*Modeling

*Evaluation

21. Name the difficulties of hierarchical clustering methods:

*Data set size limit

*Select proximity measure

*Inflexibility of the resulting classifications

*Presence of assumptions regarding the number of clusters

22. It's not true that den
Additional Information
27. The first stage of the data life cycle in accordance with the CRISP-DM methodology:

*Modeling

* Implementation (Deployment)

*Data Preparation

*Business understanding

28. Marketing processes belong to the group:

*Management processes

*Supporting processes

*Operating processes

29. Recruitment processes are classified as:

*Management processes

*Supporting processes

*Operating processes

30. Solving the problem of forecasting ...

*possible without training sample

*requires some training set of data

*is a solution to the problem of "learning without a teacher"

31. How many tarabytes are in 1 zettabyte?

*1,073742∙10^9

*2,147484∙10^9

*1,888947∙10^7

* 1024

32. Web mining technology uses Data Mining technology to analyze:

*unstructured information

*structured information

* heterogeneous information

*homogeneous information

* distributed and significant in terms of information

*information contained on Web sites

33. Establish a correspondence between the algorithms for combining two clusters and their characteristics:

A. Far Neighbor Method

B. Average link method

C. Median relationship method

D. The degree of proximity is estimated by the degree of proximity between the most distant objects of the clusters

E. The degree of proximity is estimated as the average value of the degrees of proximity between cluster objects

F. The distance between any cluster S and the new cluster resulting from the union of clusters P and Q is defined as the distance from the center of cluster S to the middle of the segment connecting the centers of clusters P and Q

34. Match the most common data processing approaches with their characteristics:

A.SQL

B. MapReduce

C. SAP HANA

D. Structured query language that allows you to work with databases. With its help, you can create and modify data, and the corresponding database management system (DBMS) is responsible for managing the data array.

E. Calculation Distribution Model. Used for parallel computing on very large datasets (petabytes or more). In the programming interface, data is not transferred to the program for processing, but the program is transferred to the data. Thus, the query is a separate program. The principle of operation is to sequentially process data in two ways

F. High performance data storage and processing platform. Provides high speed request processing. Another hallmark is that this platform simplifies the system landscape, reducing the cost of supporting analytical systems.

35. Establish a correspondence between neural network training methods and their characteristics:

A. Unsupervised learning

B. Supervised learning

C. Reinforcement learning

D. The model uses unlabeled data, from which the algorithm itself tries to extract features and dependencies

E. The neural network is trained on the labeled dataset and predicts the responses that are used to evaluate the accuracy of the algorithm on the training data

F. The system learns by interacting with the environment, not by historical data

36. What is Business Intelligence (Bl):

*Synonymous with business analysis

*Technologies and software for converting large volumes of raw information into data necessary for making management decisions

*Competitive intelligence system - collection, processing and analysis of information from various sources in order to substantiate management decisions that improve the competitiveness of the business

37. The main measures of distance between objects when using the hierarchical CA method:

*Euclidean distance

*squared Euclidean distance

*Manhattan distance

*Chebyshev distance

38. When using which method is it necessary to set the number of clusters?

*k-means method

*near neighbor method

*the whole group of hierarchical methods

*all answers are incorrect

39. It is not true that the method recommended for small selections
Similar items
Big Data Technologies test with answers
Seller:
alevtina_sar
Rating:
3,21
Sales:
0
price:
3,27 $
Web Analytics
Seller:
alevtina_sar
Rating:
3,21
Sales:
0
price:
2,72 $
Web analytics
Seller:
alevtina_sar
Rating:
3,21
Sales:
0
price:
3,27 $
Web analytics courses
Seller:
Ramzes666_
Rating:
0
Sales:
0
price:
10,89 $