Comments on: Understanding the Working of Universal Language Model Fine Tuning (ULMFiT) https://yashuseth.blog/2018/06/17/understanding-universal-language-model-fine-tuning-ulmfit/ A blog on data science, machine learning and artificial intelligence. Sun, 12 May 2019 06:29:50 +0000 hourly 1 http://wordpress.com/ By: NLP Metablog - A blog of blogs - ML-DL https://yashuseth.blog/2018/06/17/understanding-universal-language-model-fine-tuning-ulmfit/comment-page-1/#comment-14147 Sun, 12 May 2019 06:29:50 +0000 http://yashuseth.blog/?p=368#comment-14147 […] ULMFiT […]

Like

]]>
By: Yashu Seth https://yashuseth.blog/2018/06/17/understanding-universal-language-model-fine-tuning-ulmfit/comment-page-1/#comment-6165 Sat, 23 Feb 2019 14:24:11 +0000 http://yashuseth.blog/?p=368#comment-6165 In reply to kumar.

Yes, it can be used to classify any number of labels.

Like

]]>
By: kumar https://yashuseth.blog/2018/06/17/understanding-universal-language-model-fine-tuning-ulmfit/comment-page-1/#comment-5550 Tue, 19 Feb 2019 12:33:41 +0000 http://yashuseth.blog/?p=368#comment-5550 Thank you..is ULMFIT can be used to classify 4 labels

Like

]]>
By: Yashu Seth https://yashuseth.blog/2018/06/17/understanding-universal-language-model-fine-tuning-ulmfit/comment-page-1/#comment-5018 Sat, 16 Feb 2019 21:15:46 +0000 http://yashuseth.blog/?p=368#comment-5018 In reply to Ernest S Kirubakaran.

Let’s say we got h_1, h_2, h_3 ….. h_n hidden states from the last layer. These hidden states are combined as follows –
concat(max(h_1, h_2, …. h_n), mean(h_1, h_2, …. h_n), h_n)
Let’s call this vector h_final. h_final is now fed to a linear layer. The output of this linear layer (say, f1) is fed to another linear layer whose output dimension is 1 (in case of binary classification) and this gives the output (say, f2). f2 is a single real number. This f2 is fed to a sigmoid layer to get a number between 0 and 1.

Like

]]>
By: Yashu Seth https://yashuseth.blog/2018/06/17/understanding-universal-language-model-fine-tuning-ulmfit/comment-page-1/#comment-5015 Sat, 16 Feb 2019 21:03:30 +0000 http://yashuseth.blog/?p=368#comment-5015 In reply to kumarvrsec.

There are no pre-trained embeddings used. They are randomly initialized and learned during the language model pre-training step.

Like

]]>
By: kumarvrsec https://yashuseth.blog/2018/06/17/understanding-universal-language-model-fine-tuning-ulmfit/comment-page-1/#comment-4673 Fri, 15 Feb 2019 09:53:28 +0000 http://yashuseth.blog/?p=368#comment-4673 Can you explain what embeddings are used

Like

]]>
By: Ernest S Kirubakaran https://yashuseth.blog/2018/06/17/understanding-universal-language-model-fine-tuning-ulmfit/comment-page-1/#comment-3780 Mon, 11 Feb 2019 13:10:42 +0000 http://yashuseth.blog/?p=368#comment-3780 Can you please explain more about concat pooling in the final classification layer? If we get multiple hidden states each from bptt length of input sentence, how are we going to get a final number between 0 and 1? Any paper explaining this approach would be great. Thanks in advance.

Like

]]>
By: The year in AI/ML advances: 2018 roundup - AI+ NEWS https://yashuseth.blog/2018/06/17/understanding-universal-language-model-fine-tuning-ulmfit/comment-page-1/#comment-113 Sun, 23 Dec 2018 23:08:43 +0000 http://yashuseth.blog/?p=368#comment-113 […] by the idea of using language models, popularized this year by Fast.ai’s UMLFit (see also “Understanding UMLFit”). We have then seen other (and improved) approaches like Allen’s ELMO, Open AI’s […]

Like

]]>
By: A Walkthrough of InferSent – Supervised Learning of Sentence Embeddings – Let the Machines Learn https://yashuseth.blog/2018/06/17/understanding-universal-language-model-fine-tuning-ulmfit/comment-page-1/#comment-53 Sun, 05 Aug 2018 20:45:59 +0000 http://yashuseth.blog/?p=368#comment-53 […] If you are interested in transfer learning, you should check out the recent work of Jeremy Howard and Sebastian Ruder on ULMFiT (here and here). […]

Like

]]>
By: Links (18) | Nintil https://yashuseth.blog/2018/06/17/understanding-universal-language-model-fine-tuning-ulmfit/comment-page-1/#comment-49 Sun, 08 Jul 2018 15:02:05 +0000 http://yashuseth.blog/?p=368#comment-49 […] Understanding the working of Universal Language Model Finetuning […]

Like

]]>