Телеграмм чат группы natural_language

So, you have output from last layer of encoder, lets name it R_out. In decoder you have two types of attentions, as i said before. Kencdec and Vencdec are used only in Encoder Decoder attention. In this layer you calculate K,Q,V as K = R_out * W_k (this is Kencdec), V = R_out*W_v(this is Vencdec) and Q = X * W_q (X - embedding of target sequence).

источник

16:23пожаловаться #4

M

Manoj in Natural Language Processing

Ivan Dolgov

So, you have output from last layer of encoder, lets name it R_out. In decoder you have two types of attentions, as i said before. Kencdec and Vencdec are used only in Encoder Decoder attention. In this layer you calculate K,Q,V as K = R_out * W_k (this is Kencdec), V = R_out*W_v(this is Vencdec) and Q = X * W_q (X - embedding of target sequence).

Ohk. Thanks... I thought these K and V are something different

источник

16:30пожаловаться #5

M

Manoj in Natural Language Processing

I was confused between K and Kencdec... I thought both of these are different.

источник

16:32пожаловаться #6

M

Manoj in Natural Language Processing

But these are same

источник

16:32пожаловаться #7

M

Manoj in Natural Language Processing

Kencdec= K from last encoder

источник

16:33пожаловаться #8

ID

Ivan Dolgov in Natural Language Processing

Why? Kencdec is calculated in Decoder part, there are his own weights in W_k matrix, which transform output from encoder.

источник

16:42пожаловаться #9

M

Manoj in Natural Language Processing

источник

17:15пожаловаться #10

M

Manoj in Natural Language Processing

Then let me properly research...becoz in the blog it's written that the outputs of top encoder are transformed into K and V

источник

17:16пожаловаться #11

M

Manoj in Natural Language Processing

Link to blog:
http://jalammar.github.io/illustrated-transformer/

jalammar.github.io

The Illustrated Transformer

Discussions:
Hacker News (65 points, 4 comments), Reddit r/MachineLearning (29 points, 3 comments)

Translations: Chinese (Simplified), Japanese, Korean, Russian

Watch: MIT’s Deep Learning State of the Art lecture referencing this post

In the previous post, we looked at Attention – a ubiquitous method in modern deep learning models. Attention is a concept that helped improve the performance of neural machine translation applications. In this post, we will look at The Transformer – a model that uses attention to boost the speed with which these models can be trained. The Transformers outperforms the Google Neural Machine Translation model in specific tasks. The biggest benefit, however, comes from how The Transformer lends itself to parallelization. It is in fact Google Cloud’s recommendation to use The Transformer as a reference model to use their Cloud TPU offering. So let’s try to break the model apart and look at how it functions.

The Transformer was proposed in the paper Attention is All You Need. A TensorFlow…

источник

17:16пожаловаться #12

KD

Kabyken Daulet in Natural Language Processing

Ребята, привет.
Подскажите, плз, какими способами из .ann+.txt файла можно получить conll с IOB разметкой?
Гуглил, что-то не нашел толком решения.
Заранее благодарю!

источник

19:15пожаловаться #13

R

Rishi in Natural Language Processing

Valentin Malykh

in fact most of the topic models assign all the topics to a document (with different weights, of course)

Thank you.. Your inputs steered me in the correct direction.

источник

19:40пожаловаться #14

EM

Eugene Molodkin in Natural Language Processing

Kabyken Daulet

Ребята, привет.
Подскажите, плз, какими способами из .ann+.txt файла можно получить conll с IOB разметкой?
Гуглил, что-то не нашел толком решения.
Заранее благодарю!

я не помню из какого формата - кажется из WebAnno TSV - форматировал сам с помощью питона и NLTK - там есть кое-какой тулинг для Conll IOB

источник

19:52пожаловаться #15

EM

Eugene Molodkin in Natural Language Processing

существующих инструментов на тот момент для конвертации не нашел рабочих (года 3 назад)

источник

19:52пожаловаться #16

EM

Eugene Molodkin in Natural Language Processing

https://corpus-tools.org/pepper/ вот эта штука выглядела многообещающе, но не получилось завести, может сейчас лучше стало

corpus-tools.org

Pepper (corpus-tools.org)

Pepper is an swiss-army knife to convert corpora from one linguistic format to another. It used to be called SaltNPepper but it's now only known simply as Pepper

источник

19:54пожаловаться #17

KD

Kabyken Daulet in Natural Language Processing

Eugene Molodkin

я не помню из какого формата - кажется из WebAnno TSV - форматировал сам с помощью питона и NLTK - там есть кое-какой тулинг для Conll IOB

спасибо, попробую!

источник

20:33пожаловаться #18

РН

Роман Некрасов... in Natural Language Processing

коллеги, из библиотеки tensorflow-text кто-нибудь тестировал токенизаторы и прочий функционал? есть что-то важное для обработки русскоязычных текстов?

источник

20:56пожаловаться #19

2020 June 21

Ю

Юра Незнанов... in Natural Language Processing

Роман Некрасов

коллеги, из библиотеки tensorflow-text кто-нибудь тестировал токенизаторы и прочий функционал? есть что-то важное для обработки русскоязычных текстов?

лучше керас. мне кажется проще использовать его

источник

05:20пожаловаться #20