CBSE Class 10 Data Science 2023 Question Paper
2. Answer any 5 out of the given 6 questions.
(i) State whether the following statement is true or false : Subsetting helps to focus first on the required set of data.
(ii) Which of the following statement is true about median ?
- It is the smallest element of a sorted data set.
- It is the middle point of a sorted data set.
- It is the middle point of an unsorted data set.
- It is the smallest element of an unsorted data set.
(iii) Standard Deviation represents how much the data is spread out around the ______ or an average.
- Standard deviation
- Mean deviation
- Mean
- Median
(iv) Two-way relative frequency tables represent the percentage of _______ that fit in each category.
- preference
- data points
- mean values
- frequency
(v) __________ is an example of discrete data.
- Pass or Fail in a Math Test
- Length of a road
- Weather forecast
- Number of stars
Which of the following is not true for discarding the data ?
- This helps us to prevent unauthorized access to the data.
- It is important for all of us to make sure that we discard the digital data in a proper way.
- In most of the devices, if you do a soft delete of a particular file, this file deletes from the original space and cannot be restored.
- With the increased amount and intensity of cyber attacks, it is important for all of us to make sure that we discard the digital data in a proper way.
3. Answer any 5 out of the given 6 questions.
(i) Data Merging is the process of combining two or more data sets into a single _________ .
- Data Frame
- Data Designing
- Data Graphing
- Data Handling
(ii) __________ bias is an outcome of seeing what you want to see in the data.
- Linearity
- Selection
- Survivor
- Confirmation
(iii) __________ is all about counting randomness.
- Mean
- Median
- Probability
- Graphs
(iv) ______ is an example of real-life implementation of standard deviation.
- Grading tests
- Sum of values
- Sorting of data
- Spellcheck
(v) The private information that is shared should _________ be handled with confidentiality.
- never
- sometimes
- always
- often
(vi) There are two ways in which you can store the data in the __________ format or as a physical copy.
- analog
- printed
- hardcopy
- digital
4. Answer any 5 out of the given 6 questions.
(i) A percentile can be defined as the percentage of the total ordered observations at or _______ it.
- above
- equal
- below
- not at
(ii) The value of a z-score always tells us the number of standard deviations we are away from _________ .
- median
- mean
- standard deviation
- percentile
(iii) One-to-one join works by _________ the data tables using the Primary key.
- merging
- deleting
- inserting
- concatenating
(iv) Which of the following is true about distribution of an event ?
- It shows no values.
- It consists of only input values that can be seen.
- It consists of all possible values but not input values.
- It consists of both input values that can be seen and also all possible values.
(v) Which of the following is the last step of the Statistical Problem Solving Process ?
- Analyse the data
- Formulate statistical investigative questions
- Interpret the data
- Collect the data
(vi) Which of the following is used to find out the interquartile range ?
- Values of quartiles
- Values of percentiles
- Values of deciles
- Values of z-score
5. Answer any 5 out of the given 6 questions.
(i) The weight of students in a class has a mean of 22 with standard deviation of 4. What is the z-score for a student who weighs 30 kg ?
- 0
- 1
- 2
- 3
(ii) The median of the given dataset values 7, 12, 15, 16, 20 is
- 15
- 10
- 11
- 12
(iii) The mean absolute deviation for the data 3, 5, 9, 10, 15, 16 is
- 5
- 3
- 4
- 6
(iv) If two coins are tossed at the same time, what is the probability of getting Head-Tail ?
- 0.50
- 0.45
- 0.25
- 0.35
(v) Data collection designs must acknowledge _________ in data.
- Statistical Process Control
- Test Data
- Variability
- Experimental Designs
(vi) If the data that is fed into the system is __________, model accuracy and fidelity are compromised.
- biased
- predictive
- accurate
- continuous
Answer any 4 out of the given 6 questions in 20 - 30 words each.
11. What is Data-based subsetting ? Give an example.
12. What is Continuous data ? Give an example.
13. Why is Central Limit Theorem important ?
14. What is Many-to-Many Join ? Give an example to support your answer.
15. Name the two ways in which data can be stored. Give an example.
16. Define the term percentile. Give an example.
Answer any 3 out of the given 5 questions in 50 - 80 words each.
17. Give the steps to calculate Standard Deviation.
18. Explain the following components of the Statistical Problem-Solving Process with example :
(a) Formulate statistical investigative questions
(b) Collect/Consider the data
19. Differentiate between Recall Bias and Survivor Bias. Give an example of each.
20. What is the Interquartile Range ? Give an example to calculate an interquartile range.
21. Mention any two ways to discard the data stored in physical copy and any two ways to discard digital data.