Data mining

“Please be sure to review Chapter #2 and Chapter #3. After reviewing the content, please answer the following 5 questions.Please be sure to provide adequate support for your answer.
Please be sure to list at least 2 references in APA format.
Going outside the classroom and researching can enhance the learning.
1. Can you think of a situation in which identification numbers would be useful for prediction? (15 pts)

2. Give at least two advantages to working with data stored in text files instead of in a binary format (15 pts)
3. Identify at least two advantages and two disadvantages of using color to visually represent information. (15 pts)

4. Discuss the advantages and disadvantages of using sampling to reduce the number of data objects that need to be displayed. Would simple random sampling (without replacement) be a good approach to sampling? Why or why not? (15 pts)

5. Describe one advantage and one disadvantage of a stem and leaf plot with respect to a standard histogram. (15 pts)