Invest In Crypto News
  • Home
  • Latest News
    • Bitcoin News
    • Altcoin News
    • Ethereum News
    • Blockchain News
    • Doge News
    • NFT News
    • Video
    • Market Analysis
    • Business
    • Finance
    • Politics
    • Mining
    • Regulation
    • Technology
  • Top 10 Cryptos
  • Market Cap List
  • IC DAO
  • Donations
  • Contact
  • Buy Crypto
  • IC DAO
No Result
View All Result
Invest In Crypto News
  • Home
  • Latest News
    • Bitcoin News
    • Altcoin News
    • Ethereum News
    • Blockchain News
    • Doge News
    • NFT News
    • Video
    • Market Analysis
    • Business
    • Finance
    • Politics
    • Mining
    • Regulation
    • Technology
  • Top 10 Cryptos
  • Market Cap List
  • IC DAO
  • Donations
  • Contact
  • Buy Crypto
  • IC DAO
No Result
View All Result
Invest In Crypto News
No Result
View All Result

TEAL Introduces Training-Free Activation Sparsity to Boost LLM Efficiency

CryptoExpert by CryptoExpert
September 1, 2024
in Blockchain News
0
10BedICU Leverages OpenAI's API to Revolutionize Critical Care in India
  • Facebook
  • Twitter
  • Pinterest


You might also like

Paxos Dashboard Enhances Governance, Adds Self-Serve Features

Bitcoin Treasuries Add 603 BTC Amid Strategy’s Pause

Algorand (ALGO)’s xChain Accounts Enable EVM Wallet Use Without New Keys



Zach Anderson
Sep 01, 2024 08:34

TEAL offers a training-free approach to activation sparsity, significantly enhancing the efficiency of large language models (LLMs) with minimal degradation.





TEAL (Training-Free Activation Sparsity in LLMs) has emerged as a groundbreaking approach to improve the efficiency of large language models (LLMs) without requiring additional training. According to together.ai, this method applies magnitude pruning to hidden states throughout the model, achieving 40-50% activation sparsity with minimal degradation. This innovation allows for the transfer of fewer weights to on-chip memory, addressing the memory-bound nature of LLM inference and translating into 1.53-1.8x wall-clock speedups in single-batch decoding.

Background

LLMs are known for their massive size, which poses challenges during inference, primarily due to the speed limitations of transferring parameters from device memory to registers. Various techniques such as quantization, weight sparsity, and speculative decoding have been developed to tackle this ‘memory wall’. Activation sparsity, which leverages zero values in hidden states, is a less explored method that avoids transferring unnecessary weight channels during decoding.

Older models like OPT-175B show high activation sparsity, enabling methods like DejaVu to achieve significant speedups. However, newer models like LLaMA have moved to SwiGLU variants, making it harder to apply such methods. Recent research has attempted to ‘recover’ models that exhibit activation sparsity, but these require extensive retraining on massive datasets.

Motivating Study: Distributional Properties of Activations in LLMs

Research has shown that hidden states in LLMs exhibit outliers and are zero-centered with similar distributional shapes across layers. Specifically, states before MLP and Attention Blocks are Gaussian-shaped, while intermediate states are Laplacian-shaped. This suggests that many low-magnitude activations can be pruned with negligible model degradation, a concept also observed in other studies like CATS.

okex

TEAL

TEAL introduces an optimization by sparsifying every tensor in the model, achieving near-zero degradation at 25% sparsity and minimal degradation at 40% sparsity. At 50% sparsity, Llama-3 variants show slightly more degradation compared to older Llama-2 and Mistral variants. TEAL outperforms CATS by sparsifying every tensor and choosing to sparsify through input, yielding lower error.

Hardware-Aware Speed-up

To benchmark real-world speedups, TEAL was integrated with GPT-Fast, achieving significant speedups of up to 1.53x and 1.8x at 40% and 50% sparsity, respectively. While the kernel is faster than cuBLAS at 0% sparsity, there is still room for further optimization.

Compatibility with Quantization

TEAL also demonstrates compatibility with quantization, another technique for efficient LLM inference. Combining activation sparsity and quantization unlocks new regimes for transferring memory to GPU registers, allowing for higher inference speed-ups.

Applications

TEAL’s most immediate application is accelerating inference in resource-constrained edge settings, particularly in single-batch scenarios. It also aids inference providers like Together AI, which hosts over 100 open-source models across a large fleet of GPUs, by serving models more efficiently.

Image source: Shutterstock



Source link

  • Facebook
  • Twitter
  • Pinterest
CryptoExpert

CryptoExpert

Recommended For You

Paxos Dashboard Enhances Governance, Adds Self-Serve Features

by CryptoExpert
May 27, 2026
0
Paxos Dashboard Enhances Governance, Adds Self-Serve Features

Zach Anderson May 26, 2026 20:42 Paxos updates its enterprise Dashboard with scalable approvals, audit logs, and webhook management to strengthen institutional compliance. ...

Read more

Bitcoin Treasuries Add 603 BTC Amid Strategy’s Pause

by CryptoExpert
May 27, 2026
0
Pyth Network Integrates Price Oracles with IOTA EVM

Luisa Crawford May 26, 2026 12:17 Smaller Bitcoin treasuries acquired 603 BTC worth $46M despite Strategy pausing purchases. Market trends point to mid-$70K support...

Read more

Algorand (ALGO)’s xChain Accounts Enable EVM Wallet Use Without New Keys

by CryptoExpert
May 26, 2026
0
Post-Submission Steps for Algorand (ALGO) Change the Game Hackathon

Ted Hisokawa May 25, 2026 18:21 Algorand (ALGO)'s xChain Accounts leverage Smart Signature tech, letting EVM wallet users transact seamlessly on Algorand. Here's how...

Read more

BNB Chain Launches On-Chain Payments for AI Agents

by CryptoExpert
May 26, 2026
0
BNB Chain Resolves BscScan Lag Issue, opBNB Still Undergoing Fixes

Caroline Bishop May 25, 2026 13:05 BNB Chain’s Agent Survival Pack enables AI agents to pay autonomously using crypto, pushing the agent economy forward...

Read more

HBAR Price Prediction: Dead Cat Bounce or Real Rally? $0.085-$0.095 Range Battle Ahead

by CryptoExpert
May 25, 2026
0
HBAR Price Prediction: $0.065 Bottom Hunt Before Potential 30% Bounce to $0.12

Timothy Morano May 24, 2026 08:37 HBAR's 4.36% pump masks deeper weakness with RSI stuck at 46 and MACD flatlining. Smart money leans bullish...

Read more
Next Post
Solana (SOL) Whale Sees The Future in AI and Gambling As They Invest In FET and Mpeppe (MPEPE)

Solana (SOL) Whale Sees The Future in AI and Gambling As They Invest In FET and Mpeppe (MPEPE)

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Browse by Category

  • Altcoin News
  • Bitcoin News
  • Blockchain News
  • Business
  • Doge News
  • Ethereum News
  • Finance
  • Market Analysis
  • Mining
  • NFT News
  • Politics
  • Regulation
  • Technology
  • Trending Cryptos
  • Video

Sitemap

  • Market Cap
  • Donations
  • Trading
  • Mining
  • Contact

Legal Information

  • Privacy Policy
  • Anti-Spam Policy
  • Copyright Notice
  • DMCA Compliance
  • Social Media Disclaimer
  • Terms Of Service

Categories

  • Altcoin News
  • Bitcoin News
  • Blockchain News
  • Business
  • Doge News
  • Ethereum News
  • Finance
  • Market Analysis
  • Mining
  • NFT News
  • Politics
  • Regulation
  • Technology
  • Trending Cryptos
  • Video

© Copyright 2024 InvestInCryptoNews.com

No Result
View All Result
  • Home
  • Latest News
    • Bitcoin News
    • Altcoin News
    • Ethereum News
    • Blockchain News
    • Doge News
    • NFT News
    • Video
    • Market Analysis
    • Business
    • Finance
    • Politics
    • Mining
    • Regulation
    • Technology
  • Top 10 Cryptos
  • Market Cap List
  • IC DAO
  • Donations
  • Contact
  • Buy Crypto
  • IC DAO

© Copyright 2024 InvestInCryptoNews.com

This website is using cookies to improve the user-friendliness. You agree by using the website further.

Privacy policy
bitcoin
Bitcoin (BTC) $ 74,236.00
ethereum
Ethereum (ETH) $ 2,019.57
tether
Tether (USDT) $ 0.998539
bnb
BNB (BNB) $ 647.04
xrp
XRP (XRP) $ 1.30
usd-coin
USDC (USDC) $ 0.999758
solana
Solana (SOL) $ 82.24
tron
TRON (TRX) $ 0.367754
figure-heloc
Figure Heloc (FIGR_HELOC) $ 1.03
staked-ether
Lido Staked Ether (STETH) $ 2,265.05

Pin It on Pinterest

Are you sure want to unlock this post?
Unlock left : 0
Are you sure want to cancel subscription?