Posts: 1,678
   
Threads: 176
    
Likes Received: 307 in 266 posts
Likes Given: 87
"Senator, I am Singaporean."
🤣🤣🤣
Posts: 6,594
   
Threads: 344
    
Likes Received: 1,278 in 1,098 posts
Likes Given: 1,572
So it is Nvidia selling from Singapore 🤣
Posts: 13,454
   
Threads: 589
    
Likes Received: 2,873 in 2,530 posts
Likes Given: 50
Typical of a loser mentality … lose … point finger at everyone, blame everyone instead of looking at the mirror
KTV妹妹说,香港人无义,台湾人无情,新加坂人无智
>
Posts: 1,678
   
Threads: 176
    
Likes Received: 307 in 266 posts
Likes Given: 87
(01-02-2025, 09:42 PM)moonrab Wrote: So it is Nvidia selling from Singapore 🤣
You can Google for it. Nvidia Asia Hq in Suntec city
Posts: 23,647
   
Threads: 51
    
Likes Received: 4,425 in 3,985 posts
Likes Given: 1,153
For small gujin guraj companies it's easy to handle. But nvidia woh, how ah?
Posts: 14,546
   
Threads: 112
    
Likes Received: 968 in 1,003 posts
Likes Given: 2,197
Posts: 15,712
   
Threads: 683
    
Likes Received: 3,077 in 2,738 posts
Likes Given: 980
The one used is A100 GPU. Very old ones. Even if u use the newest one and u won't get the maximum benefits because all usage of Nvidia GPU goes thru the CUDA application layer.
DeepSeek bypass the CUDA layer and wrote their own algorithm.

That one one way to 弯道超车
Posts: 23,647
   
Threads: 51
    
Likes Received: 4,425 in 3,985 posts
Likes Given: 1,153
Bypassing cuda and rewrite the code in ptx (or mix with c/c++) can give much better math calculations and hardware specific SIMD operations. And to do that within such a short time frame is simply amazing
Posts: 6,399
   
Threads: 142
    
Likes Received: 938 in 871 posts
Likes Given: 296
(02-02-2025, 01:00 PM)WhatDoYouThink! Wrote: Bypassing cuda and rewrite the code in ptx (or mix with c/c++) can give much better math calculations and hardware specific SIMD operations. And to do that within such a short time frame is simply amazing
tomshardware.com › tech-industry › artificial-intelligence › deepseeks-ai-breakthrough-bypasses-industry-standard-cuda-uses-assembly-like-ptx-programming-instead
DeepSeek's AI breakthrough bypasses industry-standard CUDA for some ...
5 days ago PTX sits between higher-level GPU programming languages (like CUDA C/C++ or other language frontends) and the low-level machine code (streaming assembly, or SASS).
Posts: 23,647
   
Threads: 51
    
Likes Received: 4,425 in 3,985 posts
Likes Given: 1,153
I think they wrote the whole program in cuda, and only need to rewrite the compute-intensive, iterative training loops in ptx which is only a small part.
Users browsing this thread: 1 Guest(s)