Perplexity shows how to run monster AI models more efficiently on aging GPUs, AWS networks

News

Some clever networking hacks open the door
AI search provider Perplexity’s research wing has developed a new set of software optimizations that allows for trillion parameter or large models to run efficiently across older, cheaper hardware using a variety of existing network technologies, including Amazon’s proprietary Elastic Fabric Adapter.…The RegisterRead More