11-02-2025, 12:11 PM
11-02-2025, 12:38 PM
Yahoo and scmp so quick and aso turn around to sing the praises of china? 太现实了吧
二毛们怎么办是好?
二毛们怎么办是好?
11-02-2025, 01:19 PM
二毛自己吃自己
11-02-2025, 01:35 PM
It seems by removjng ghe middle CUDA layer they gajn the efficiency and make the Huawei chip perform even better than
NVIDIA.
NVIDIA.
11-02-2025, 01:48 PM
India so many programmers how cum they aren't able to do that? Looks like they are not too smart
11-02-2025, 01:48 PM
They will need lots more chips as 910 only can delivers-60-percent-nvidia-h100-inference-performance.
https://www.tomshardware.com/tech-indust...erformance
https://www.tomshardware.com/tech-indust...erformance
11-02-2025, 01:50 PM
(11-02-2025, 01:48 PM)WhatDoYouThink! Wrote: [ -> ]India so many programmers how cum they aren't able to do that? Looks like they are not too smart
Their programmers graduated from dubious Uni and taught by dubious half headed Profs, who were similarly taught
11-02-2025, 01:52 PM
(11-02-2025, 01:35 PM)sgbuffett Wrote: [ -> ]It seems by removjng ghe middle CUDA layer they gajn the efficiency and make the Huawei chip perform even better than
NVIDIA.
Nvda wanna lock users to their CUDA app layer. Deepseek did a machine level programming bypass and increase efficiency.
But that level of expertise is not easy. Doubt SG have it.
Look like the algorithm is the key and hardware in certain extent but not main key.
11-02-2025, 01:55 PM
Dyn worry too much. Ds and hw will catch up very fast
11-02-2025, 01:59 PM
(11-02-2025, 01:52 PM)Niubee Wrote: [ -> ]Nvda wanna lock users to their CUDA app layer. Deepseek did a machine level programming bypass and increase efficiency.
But that level of expertise is not easy. Doubt SG have it.
Look like the algorithm is the key and hardware in certain extent but not main key.
I have tried embedded assembly code to speed up access.
Doable.
11-02-2025, 02:07 PM
(11-02-2025, 01:52 PM)Niubee Wrote: [ -> ]Nvda wanna lock users to their CUDA app layer. Deepseek did a machine level programming bypass and increase efficiency.
But that level of expertise is not easy. Doubt SG have it.
Look like the algorithm is the key and hardware in certain extent but not main key.
Sg employs a lot of cheap CECA!
How to have it?
My son also lost his job recently because company outsource to CECA!

11-02-2025, 02:07 PM
(11-02-2025, 01:59 PM)sgbuffett Wrote: [ -> ]I have tried embedded assembly code to speed up access.
Doable.
Deepseek uses assembly-like PTX programming
11-02-2025, 02:13 PM
(11-02-2025, 01:48 PM)teaserteam Wrote: [ -> ]They will need lots more chips as 910 only can delivers-60-percent-nvidia-h100-inference-performance.
https://www.tomshardware.com/tech-indust...erformance
Google search gave me this :
When comparing the Huawei Ascend 910C to Nvidia's AI chips, the Ascend 910C is positioned as a strong competitor, particularly in the Chinese market, claiming to offer comparable or even superior performance in certain AI tasks, though experts generally acknowledge that Nvidia still holds the overall market lead in terms of technology and wider adoption;
.
.