Dataset file : google drive link
Hello Community , I need help regarding how to apply KNN clustering on this use case.
I have a dataset consisting (27884 ROWS, 8933 Columns)
Here's a little preview of a dataset
| user_iD | b1 | b2 | b3 | b4 | b5 | b6 | b7 | b8 | b9 | b10 | b11 |
|---|---|---|---|---|---|---|---|---|---|---|---|
| 1 | 1 | 7 | 2 | 3 | 8 | 0 | 4 | 0 | 6 | 0 | 5 |
| 2 | 7 | 8 | 1 | 2 | 4 | 6 | 5 | 9 | 10 | 3 | 0 |
| 3 | 0 | 0 | 0 | 0 | 1 | 5 | 2 | 3 | 4 | 0 | 6 |
| 4 | 1 | 7 | 2 | 3 | 8 | 0 | 5 | 6 | 0 | 4 | |
| 5 | 0 | 4 | 7 | 0 | 6 | 1 | 5 | 3 | 0 | 0 | 2 |
| 6 | 1 | 0 | 2 | 3 | 0 | 5 | 4 | 0 | 0 | 6 | 7 |
Here the column userid represents: STUDENTS and columns b1-b11: They represent Book Chapters and the sequence of each student that which chapter he/she studied first then second then third and so on. the 0 entry tells that the student did not study that particular chapter.
This is just a small preview of a big dataset. There are a total of 27884 users and 8932 Chapters stated as (b1--b8932)
I need to find a similar pattern and thus need to apply KNN clustering, how do I do that?
from Applying KNN Clustering based on user id
No comments:
Post a Comment