We tried out Pixtral, the new Mistral AI model and it's impressive.

Pixtral model is available online at Le Chat

© Mistral Amnesty International

Mistral AI announces that Le Chat, its AI-powered chatbot, is now available to everyone. While it used to take nearly 24GB of data to download in a torrent file to use Pixtral, the company has just bundled it into its web service.

Advertisement, Your content continues below

Mistral AI's first multi-media AI in web version

Similar to ChatGPT or Google Gemini, Le Chat is available in a web version. Until now, it allowed dialogue with the Mistral Nemo, Codestral and Mistral Large 2 AI language models, and this new version gives free access to Pixtral 12-B, Mistral's first multimodal AI model.

Mistral cat models

AI models are available on Le Chat

© Screenshot // Mistral

In other words, a multimodal language includes its ability to process different data formats: for this language, it is the ability to analyze texts and images.

pixelal standards

Bextral 12-B Standards

© Mistral Amnesty International

According to benchmarks published by Mistral AI, the startup is happy to match, and sometimes even outperform, some larger models, such as the LLaVA-OV 7B.

Pixtral Test: Generate HTML Code from a Diagram

We wanted to check out the capabilities of Pixtral 12-B highlighted by Mistral AI. The company claims that its language can generate computer code from a hand-drawn sketch. So we drew a web page on an iPad using the Procreate app using an Apple Pencil.

Mistral AI 12B Pixel Test Code

Raised drawing of a web page

© Florent Lane for Les Numériques

When we submitted this image to Pixtral, we included this prompt: “Write HTML code to create a site like this.” The chat is triggered and the source code is generated in HTML format.

HTML pixel 12b

Pixtral 12B generates HTML code from image

© Florent Lane for Les Numériques

It's strange that we were so quick to display the HTML code in the browser version.

Mistral AI Pixtral HTML

HTML web page created by Pixtral 12-B with Le Chat

© Florent Lane for Les Numériques

Although the result may seem a bit superficial, the optical handwriting recognition is very effective. The layout is generally respected, except for the news sites which are not laid out as shown in the diagram.

Advertisement, Your content continues below

See also  Watch out, the new world beta literally burned GeForce 3080 and 3090

Stan Shaw

<p class="sign">"Professional food nerd. Internet scholar. Typical bacon buff. Passionate creator."</p>

Leave a Reply

Your email address will not be published. Required fields are marked *

Back to top