ReLU Strikes Back: Exploiting Activation Sparsity in Large Language Models (2023)
Subreddit to discuss AI & Llama, the large language model created by Meta AI.
I found a way to speed up CPU based LLM inference using a HNSW index on the output embeddings
Subreddit to discuss AI & Llama, the large language model created by Meta AI.
I made a Three Body Problem Simulator to explore the emergence of complexity from simple physical systems.
r/SideProject is a subreddit for sharing and receiving constructive feedback on side projects.
I built a Three Body Problem Simulator to explore the emergence of complexity from simple physical systems.
Accelerate GPT Output Embedding computations with a Vector Index
Subreddit to discuss AI & Llama, the large language model created by Meta AI.
Accelerate GPT Output Embedding computations with a Vector Index
Subreddit about Artificial Neural Networks, Deep Learning and Machine Learning.