Saltar navegación

Activa JavaScript para disfrutar de los vídeos de la Mediateca.

Curso IA programamos.es -- cómo entienden el texto los ordenadores_eng - Contenido educativo

Ajuste de pantalla

El ajuste de pantalla se aprecia al ver el vídeo en pantalla completa. Elige la presentación que más te guste:

Subido el 29 de marzo de 2024 por David G.

18 visualizaciones

Descargar la transcripción

As we already know, computers can only work with numbers, and yet, there are computer systems that are capable of understanding our texts. 00:00:00
How does this happen? 00:00:08
What is used is a mechanism to translate words or phrases into a numerical representation known as embeddings. 00:00:10
As Romea Jeremy Howard says in his book "AI Applications Without a PhD", 00:00:17
the artificial intelligence community sometimes likes to use rather pompous names for concepts that are actually very simple. 00:00:22
And with embeddings, this is somewhat the case. Let's see how they are built. 00:00:30
Imagine we are in a situation where a numerical representation has already been assigned to a set of words, using two numbers. 00:00:34
Where would we place the word "apple"? 00:00:42
Near position A there are several round objects. Near position B there are words related to constructions. 00:00:45
But at position C, we would have the word "apple" close to others related to fruits. 00:00:53
This would be a good location, since the goal of embeddings is for similar words 00:00:58
to correspond to nearby points, and words that are different to correspond 00:01:02
to distant points. 00:01:07
Let's see another example. 00:01:09
Suppose we have already assigned a numerical representation to the words "dog", "puppy", and 00:01:11
"calf". 00:01:17
Where would we place the word "cow"? 00:01:18
All three positions could make sense, but if we place it at position 00:01:20
C, we would be capturing some relationships between the words, which is precisely another 00:01:25
goal of embeddings. In this case, we would be capturing two analogies. 00:01:30
On one hand, "puppy" is to "dog" as "calf" is to "cow". And on the other, "puppy" is to "calf", 00:01:36
as "dog" is to "cow". Thus, this embedding would be capturing two properties of the 00:01:42
words, age and size. And basically, these are embeddings. What happens is that the 00:01:49
ones we use in real applications have hundreds or thousands of dimensions, meaning 00:01:54
that a word is translated into a vector of hundreds or thousands of numbers. 00:01:58
As detailed in the article associated with this video, these embeddings allow 00:02:04
performing visualizations and classroom activities that are very interesting and that 00:02:08
could be the 21st-century equivalent of learning to explore a dictionary. 00:02:13
But these word embeddings have certain limitations when it comes to 00:02:19
recognizing phrases, since the same word can mean different things depending on the 00:02:22
context. Fortunately, since transformers were born with their attention mechanism that 00:02:28
allows understanding the context, we also have embeddings that are capable of assigning a 00:02:34
numerical representation to complete phrases coherently. Thus, we can see that the phrase 00:02:39
"I like basketball more than anything" is semantically closer to "I love basketball" 00:02:45
than the phrase "I love football", even though these last two share more words 00:02:51
in common. And there are even multilingual phrase embeddings where phrases that 00:02:56
mean the same thing in different languages ​​receive a close numerical representation. 00:03:02
As we will see in future installments, these word and phrase embeddings are the basis 00:03:07
of large language models like GPT-3 and Bloom. But until we get there, don't 00:03:12
stop playing with the challenges and tasks we propose on our website, as they 00:03:17
will allow you to interact directly with the internal workings of many 00:03:22
of the artificial intelligence systems we use daily. 00:03:26
Idioma/s:
es
Idioma/s subtítulos:
en
Autor/es:
Programamos.es
Subido por:
David G.
Licencia:
Reconocimiento
Visualizaciones:
18
Fecha:
29 de marzo de 2024 - 23:08
Visibilidad:
Público
Centro:
IES MARIE CURIE Loeches
Duración:
03′ 30″
Relación de aspecto:
1.78:1
Resolución:
854x480 píxeles
Tamaño:
17.73 MBytes

Del mismo autor…

Ver más del mismo autor


EducaMadrid, Plataforma Educativa de la Comunidad de Madrid

Plataforma Educativa EducaMadrid