Ya está disponible el modelo de lenguaje GPT-3 // GPT-3 language model is now available

Este 18 de noviembre de 2021 OpenAI ha abierto las puertas a todo el público de su modelo de lenguaje autorregresivo, GPT-3. Cuando hablamos de GPT-3 hablamos de un enorme modelo de inteligencia artificial (LLM – Large Languaje Model) capaz de generar lenguaje escrito a partir de una secuencia proporcionada. Este sistema se hizo muy popular cuando en el año 2020, OpenAI publicó sus resultados donde se mostraba el gran potencial y rendimiento que tenía este modelo para generar lenguaje natural (Probando GPT-3, Dot CSV).

GPT-3 es un modelo de lenguaje que principalmente ha sido entrenado con texto en inglés, pero esto no quita que su rendimiento sea mejor que en otros idiomas, ya que no se sabe si su capacidad de lenguaje trasciende al propio idioma, para luego comunicarse traduciendo en inglés. Hay que tener en cuenta que GPT3 no se basa en una realidad rigurosa. El texto generado solo busca que sea coherente en la forma, no en el fondo.

Desde el minuto cero, esta tecnología ha estado cerrada al público general, pero desde la semana pasada esto ha cambiado. Tras abrir el acceso de esta tecnología, cualquier persona puede utilizarla o incluso investigar cómo funciona. El acceso a GPT-3 es bastante sencillo, simplemente hay que registrarse en la (página web de OpenAI) y seleccionar el motivo por el cuál se quiere utilizar esta API. El uso de GPT3 no es gratuito, pero OpenAI proporciona un saldo inicial para todo usuario registrado que basta para estudiar el sistema.

En este punto, la forma más rápida de jugar con GPT-3 es a través del “Playground”, básicamente una ventana donde puedes escribir cualquier texto como input para que este modelo continúe con la generación.

A la derecha del “Playground” se pueden encontrar distintos parámetros para cambiar el comportamiento de GPT-3, como la temperatura (nos permite tener respuestas más creativas) o la longitud de la respuesta. De todos estos parámetros, el más importante es el “Engine”. GPT-3 no es una única inteligencia artificial, sino una familia de modelos con diferentes tamaños (cuanto mayor más potente pero más lento y caro). Los modelos “ada” serán más baratos y rápidos, destinados a resolver tareas sencillas. Lo importante de esta opción son los modelos “Base” y los modelos “Instruct”, pensados para plantearle al modelo instrucciones en lugar de ejemplos.

This November 18, 2021 OpenAI has opened the doors of its autoregressive language model, GPT-3, to the public. When we talk about GPT-3 we are talking about a huge artificial intelligence model (LLM – Large Language Model) capable of generating written language from a given sequence. This system became very popular when in 2020, OpenAI published its results showing the great potential and performance of this model to generate natural language (Testing GPT-3, Dot CSV).

GPT-3 is a language model that has mainly been trained with English text, but this does not detract from the fact that its performance is better than in other languages, since it is not known if its language capability transcends the language itself, to then communicate by translating into English. It should be noted that GPT3 is not based on a rigorous reality. The text generated is only intended to be coherent in form, not in substance. From minute zero, this technology has been closed to the general public, but since last week this has changed. After opening access to this technology, anyone can use it or even investigate how it works. Access to GPT-3 is quite simple, you just have to register on the (OpenAI website) and select the reason why you want to use this API. The use of GPT3 is not free, but OpenAI provides an initial balance for every registered user that is enough to study the system.

At this point, the fastest way to play with GPT-3 is through the «Playground«, basically a window where you can type any text as input for this model to continue with the generation.

On the right side of the «Playground» you can find different parameters to change the behavior of GPT-3, such as the temperature (allows us to have more creative responses) or the length of the response. Of all these parameters, the most important is the «Engine«. GPT-3 is not a single artificial intelligence, but a family of models with different sizes (the bigger the more powerful but slower and more expensive). The «ada» models will be cheaper and faster, intended to solve simple tasks. The important part of this option are the «Base» models and the «Instruct» models, designed to provide the model with instructions instead of examples.

Related Posts