Hemant Vishwakarma: Loss function for comparing two vectors for categorization

Monday, 24 May 2021

Loss function for comparing two vectors for categorization

I am performing a NLP task where I analyze a document and classify it into one of six categories. However, I do this operation at three different time periods. So the final output is an array of three integers (sparse), where each integer is the category 0-5. So a label looks like this: [1, 4, 5].

I am using BERT and am trying to decide what type of head I should attach to it, as well as what type of loss function I should use. Would it make sense to use BERT's output of size 1024 and run it through a Dense layer with 18 neurons, then reshape into something of size (3,6)?

Finally, I assume I would use Sparse Categorical Cross-Entropy as my loss function?

from Loss function for comparing two vectors for categorization

Hemant Vishwakarma

Monday, 24 May 2021

Loss function for comparing two vectors for categorization

No comments:

Post a Comment