2024-06-04

Parallelizing the un-parallelizable

At the recent ICLR conference, we showed how it is possible to parallelize Recurrent Neural Networks.

We achieved a feat that many people think cannot be done: parallelizing Recurrent Neural Networks (RNN) over the time axis, achieving more than 100x speed ups. This is what our Chief Scientific Officer presented at the Twelfth International Conference on Learning Representations (ICLR) in May.

RNN is a very common class of neural networks architecture to process sequential or time series data. It has been successfully applied in many fields, including natural language processing, finance, neuroscience, and medical science.

Despite its wide applications, RNN is known to be slow to train because of its sequential nature. For example, to process a sequence of length 1000, it needs to do a sequential loop 1000 times. Think of it as reading a book: you must read each page before moving on to the next for it to make sense. RNN’s sequential nature is something that cannot be done efficiently in modern deep learning architectures, such as GPUs. It is slow to train, which is also a main contributing factor for why it’s less preferred over Transformer in building large language models.

In the work that we presented at ICLR 2024, we show that it is possible to parallelize RNNs, achieving orders of magnitude speed-up over the traditional sequential method on a GPU. The idea is based on a method that has existed for hundreds of years: Newton’s method. We translated Newton’s method to the problem of RNN and developed a new algorithm that can parallelize RNNs. The result is that we can evaluate RNN more than 100x faster than a sequential method, even for a sequence with length of 1M. Further details can be found in our paper.

Our method is not without limitations. Currently the method is limited to a small number of dimensions and it consumes large amounts of memory. These limitations prevent its use in interesting applications, presenting some challenges that need to be overcome in the future. Despite its limitations, we show that it is possible to parallelize the un-parallelizable RNN! This research generated a great discussion during our poster presentation at ICLR, furthering the importance of this work.

Muhammad Kasim, Co-founder + CSO

Muhammad invented the machine learning technology that launched the company, establishing the foundation of all of our work so far. As CSO, he leads the company's machine learning and deep learning technology R&D, from exploring new applications to developing new architecture and algorithms. Mach42 is the second Oxford spin-out he has co-founded.

Latest Articles

2024-06-04

Parallelizing the un-parallelizable

At the recent ICLR conference, we showed how it is possible to parallelize Recurrent Neural Networks.

2024-05-07

Our Story

Learn how we started as an Oxford University spinout, and why our vision is to cut the product design development cycle in half

2024-04-30

Applications for Acceleration

There’s more to speed than moving fast

2024-04-11

Intersection of Multi-Physics and Artificial Intelligence

Have you ever wondered how artificial intelligence and multi-physics are connected?

2023-05-25

"From Unknown Unknowns to Known Unknowns"

Unravelling the beauty of uncertainty

2022-05-19

The Next Step Change in Simulation Performance

Significantly speeding up simulations with machine learning

2023-02-14

Days to Seconds

A new lens – imagine the future possibilities

2023-05-30

MD's AI-Powered Solution Selected to Support Consortium on Fusion Energy Research

Public-Private Partnership Secures $14 Million Funding for Fusion Power Industry

2023-03-06

Perspective on Fusion Energy with Lasers

Can Machine Learning help ease the arduous path toward a viable solution?

2022-07-12

The Rising Complexity in Science

Computational workflows must change if we are to win the battle over soaring research complexity

2022-09-12

MD's Board Chair Recognised

Janet Collyer celebrated on International Women in Engineering Day, 2022

2022-07-12

The Mind-Expanding Power of the Right Interface

How we interact with our tools can completely transform our results