Total Questions : 50
Expected Time : 50 Minutes

1. In data mining, what does the term 'supervised learning' refer to?

2. Which data mining technique is commonly used for finding associations and relationships among variables in large datasets?

3. What is the purpose of the 'apriori' algorithm in data mining?

4. What is the 'no free lunch' theorem, and how does it apply to data mining?

5. What does the term 'overfitting' refer to in the context of machine learning and data mining?

6. What is the primary goal of data mining in the field of knowledge discovery?

7. What is the significance of 'precision-recall tradeoff' in classification models?

8. Explain the difference between supervised and unsupervised learning in data mining.

9. How does 'overfitting' impact the performance of a data mining model?

10. How does 'overfitting' impact the performance of a machine learning model?

11. Which algorithm is commonly used for association rule mining?

12. How does 'principal component analysis' (PCA) contribute to dimensionality reduction in data mining?

13. How does the 'apriori' algorithm determine association rules?

14. In data mining, what is 'feature importance'?

15. Explain the concept of 'entropy' in decision tree algorithms.

16. What role does feature selection play in the data mining process?

17. What is the primary purpose of 'association rule mining' in data analysis?

18. What is the primary purpose of 'stratified sampling' in the context of data mining?

19. What role does 'gradient boosting' play in improving the performance of machine learning models?

20. In data mining, what does the term 'imbalanced dataset' refer to?

21. What is the 'apriori principle' in association rule mining?

22. What is the primary goal of data mining?

23. In the context of data mining, what is the purpose of 'cluster analysis'?

24. What is 'feature engineering' in the context of data mining?

25. Explain the concept of 'hyperparameter tuning' and its importance in machine learning.

26. How does the 'k-means' algorithm work in the context of clustering?

27. Which data mining technique is commonly used for predicting categorical outcomes?

28. In machine learning, what is the significance of the 'bias-variance tradeoff'?

29. How does the 'apriori' algorithm work in association rule mining?

30. What is the primary objective of cross-validation in data mining?

31. What is the 'curse of overfitting' in machine learning, and how does it relate to data mining?

32. How does the process of 'feature scaling' contribute to the effectiveness of certain machine learning algorithms?

33. What is the purpose of cross-validation in data mining?

34. In data mining, what does the term 'ensemble learning' refer to?

35. What is the primary goal of the 'k-nearest neighbors' (k-NN) algorithm in data mining?

36. What is the significance of 'confusion matrix' in evaluating the performance of a classification model?

37. What role does clustering play in data mining?

38. In data mining, what is 'ensemble learning' and how does it enhance predictive modeling?

39. Explain the concept of 'lift' in association rule mining.

40. How does the 'random forest' algorithm improve predictive performance in data mining?

41. In the context of classification, what does the term 'recall' measure?

42. How does the 'silhouette score' measure the quality of clustering?

43. Which data mining technique focuses on identifying patterns that describe the relationships between variables?

44. In data mining, what does the term 'anomaly detection' refer to?

45. What is the difference between classification and regression in data mining?

46. What is the purpose of the 'lift ratio' in association rule mining?

47. What is the primary objective of 'dimensionality reduction' techniques in machine learning?

48. Which term is commonly used to describe the process of finding hidden patterns or structures in data?

49. Which type of data mining task involves assigning predefined categories to items based on their characteristics?

50. How does the 'Naive Bayes' algorithm work in the context of classification?