Profile Picture
  • All
  • Search
  • Images
  • Videos
    • Shorts
  • Maps
  • News
  • More
    • Shopping
    • Flights
    • Travel
  • Notebook
Report an inappropriate content
Please select one of the options below.
Cards Factory
Train Sim World
英伟达 NV1 发布
Tesla P4
NVIDIA
Ai
How GPU S Are Made
Pharaoh Total War
Creative
Assembly
NVIDIA
AI Factory
GPU Factories
GPU Manufacturing
Graphics Card Factory
Insane Gaming PC Build
NVIDIA
Soldering PCB
How Are Ai Chips Made
Siemens Ai
Silicon Graphics Computer
Process
of Making GPU
NVIDIA
Ai Chips
How GPU Is Made
Caterpillar
NVIDIA
Thermaltake Minecube 360
NVIDIA
Ai Workbench
NVIDIA
TSMC
AMD PC
Notebooklm Pricing
Factory
What Is
NVIDIA
Ai in Manufacturing
Machines and More 9700X
  • Length
    AllShort (less than 5 minutes)Medium (5-20 minutes)Long (more than 20 minutes)
  • Date
    AllPast 24 hoursPast weekPast monthPast year
  • Resolution
    AllLower than 360p360p or higher480p or higher720p or higher1080p or higher
  • Source
    All
    Dailymotion
    Vimeo
    Metacafe
    Hulu
    VEVO
    Myspace
    MTV
    CBS
    Fox
    CNN
    MSN
  • Price
    AllFreePaid
  • Clear filters
  • SafeSearch:
  • Moderate
    StrictModerate (default)Off
Filter
    Cards Factory
    Train Sim World
    英伟达 NV1 发布
    Tesla P4
    NVIDIA
    Ai
    How GPU S Are Made
    Pharaoh Total War
    Creative
    Assembly
    NVIDIA
    AI Factory
    GPU Factories
    GPU Manufacturing
    Graphics Card Factory
    Insane Gaming PC Build
    NVIDIA
    Soldering PCB
    How Are Ai Chips Made
    Siemens Ai
    Silicon Graphics Computer
    Process
    of Making GPU
    NVIDIA
    Ai Chips
    How GPU Is Made
    Caterpillar
    NVIDIA
    Thermaltake Minecube 360
    NVIDIA
    Ai Workbench
    NVIDIA
    TSMC
    AMD PC
    Notebooklm Pricing
    Factory
    What Is
    NVIDIA
    Ai in Manufacturing
    Machines and More 9700X
You now convert any LLM into a faster one without retraining from scratch.NVIDIA just did this to their 30B model. Here's the trick:1. Duplicate the model into two copies2. Freeze one copy, it just reads the prompt and remembers context3. Train the other copy to write chunks of text at once instead of one word at a time4. Run them togetherThe frozen copy barely costs anything (it's already trained). The new copy only needed ~8% of the original training data to learn the new trick.Result: 2.4x fa
0:13
You now convert any LLM into a faster one without retraining from scratch.NVIDIA just did this to their 30B model. Here's the trick:1. Duplicate the model into two copies2. Freeze one copy, it just reads the prompt and remembers context3. Train the other copy to write chunks of text at once instead of one word at a time4. Run them togetherThe frozen copy barely costs anything (it's already trained). The new copy only needed ~8% of the original training data to learn the new trick.Result: 2.4x fa
103.4K views1 day ago
x.comLior Alexander
See more
Static thumbnail place holder
More like this
  • Privacy
  • Terms