Generative AI features under the Apple Intelligence banner have steered clear from leveraging NVIDIA GPUs to handle cloud-based inputs, with the California-based giant sticking with its custom silicon in its servers that will eventually be replaced by the unreleased M4 Ultra to speed up its Large Language Models. However, a recent blog post from the iPhone maker reveals that Apple and its engineers are not shying away from partnering with NVIDIA if it means both entities have a common goal; implementing faster text generation performance with LLMs. A new ‘Recurrent Drafter’ technique has been published and open-sourced by Apple, and […]
Read full article at https://wccftech.com/apple-and-nvidia-researching-on-redrafter-technique-to-increase-llm-performance/