Member-only story

The Transformer Algorithm with the Lowest Optimal Time Complexity Possible | HackerNoon

Thomas Cherickal

14 min readFeb 2, 2024

Prologue

We are only talking about time complexity in this article — deliberately.

For space complexity, refer to my article on 1-bit transformers, available here:

Introduction

We are racing forward into the future as far as Generative AI technology is concerned and the algorithms behind Large language Models are no exception. In this article, we are going to cover three of the most exciting developments in the field of generative AI recently, and talk about them in detail. One of them has also achieved the optimal time complexity to run a large language model algorithm. In other words, a recent development has become the most optimally fastest LLM transformer algorithm possible — it is, by our current models, not possible to go faster than that as far as asymptotic time complexity is concerned, except by constant time optimizations. Because we are dealing with hundreds of billions of parameters, the constants’ speed-up can be rather large! I hope you’re as excited as I am because this will be an exciting ride!

The Transformer Algorithm with the Lowest Optimal Time Complexity Possible | HackerNoon

Prologue

Introduction

Written by Thomas Cherickal

No responses yet