Friday, 11 March 2022

Applying KNN Clustering based on user id

Dataset file : google drive link

Hello Community , I need help regarding how to apply KNN clustering on this use case.

I have a dataset consisting (27884 ROWS, 8933 Columns)

Here's a little preview of a dataset

user_iD b1 b2 b3 b4 b5 b6 b7 b8 b9 b10 b11
1 1 7 2 3 8 0 4 0 6 0 5
2 7 8 1 2 4 6 5 9 10 3 0
3 0 0 0 0 1 5 2 3 4 0 6
4 1 7 2 3 8 0 5 6 0 4
5 0 4 7 0 6 1 5 3 0 0 2
6 1 0 2 3 0 5 4 0 0 6 7

Here the column userid represents: STUDENTS and columns b1-b11: They represent Book Chapters and the sequence of each student that which chapter he/she studied first then second then third and so on. the 0 entry tells that the student did not study that particular chapter.

This is just a small preview of a big dataset. There are a total of 27884 users and 8932 Chapters stated as (b1--b8932)

I need to find a similar pattern and thus need to apply KNN clustering, how do I do that?



from Applying KNN Clustering based on user id

No comments:

Post a Comment