Announcement_11.3.2025
Our ASPLOS’25 paper PAPI: Exploiting Dynamic Parallelism in Large Language Model Decoding with a Processing-In-Memory-Enabled Computing System is online.
Our ASPLOS’25 paper PAPI: Exploiting Dynamic Parallelism in Large Language Model Decoding with a Processing-In-Memory-Enabled Computing System is online.