Skynet Report

Adept AI has released Fuyu-8B, a smaller version of its multimodal model that powers the company’s product.

According to the company, the model is exciting because it is designed from the ground up for digital agents and is easy to understand, scale and deploy, supporting arbitrary image resolutions and doing fine-grained localisation on screen images.

In addition, the model performs well on standard image understanding benchmarks.

The company warns that faces and people are generally not generated properly and that the model should not be used to generate factual representations of people or events.