Cost-Per-Byte Principle in Generative AI

Xiaoyi Li

Cost-Per-Byte Principle in Generative AI

Generative AI models are increasingly used across various modalities, including text, images, audio, and video. Estimating the computational cost of generating con- tent is crucial for optimizing performance and resource allocation. This paper intro- duces the Cost-Per-Byte Principle: C = T × I, a universal law that relates the cost of content generation to per-byte generation time and per-second inference cost. We derive the per-byte generation time analytically based on the model’s computational requirements (FLOPs) and the hardware’s performance (FLOPs per second). By estab- lishing mappings between bytes and different content units (characters, pixels, samples, frames), we provide a modality-agnostic framework for cost estimation. We present a rigorous proof of the principle’s validity and apply it to estimate the costs of current popular models, using publicly available evidence to verify the accuracy and usefulness of this principle.

Comments: 10 Pages.

Download: PDF

Submission history

[v1] 2024-11-13 22:17:11

Unique-IP document downloads: 382 times

Vixra.org is a pre-print repository rather than a journal. Articles hosted may not yet have been verified by peer-review and should be treated as preliminary. In particular, anything that appears to include financial or legal advice or proposed medical treatments should be treated with due caution. Vixra.org will not be responsible for any consequences of actions that result from any form of use of any documents on this website.

Add your own feedback and questions here:
You are equally welcome to be positive or negative about any paper but please be polite. If you are being critical you must mention at least one specific error, otherwise your comment will be deleted as unhelpful.

Artificial Intelligence

Cost-Per-Byte Principle in Generative AI

Submission history