Welcome Guest ( Log In | Register )

Outline · [ Standard ] · Linear+

 Full DeepSeek R1 At Home 🥳🥳🥳

views
     
Penamer
post Jan 28 2025, 05:27 PM

New Member
*
Junior Member
12 posts

Joined: Aug 2022
Just wait for the China version for AI clustering. Sure cheap like cabbage prices.
Penamer
post Jan 28 2025, 05:44 PM

New Member
*
Junior Member
12 posts

Joined: Aug 2022
QUOTE(kingkingyyk @ Jan 28 2025, 02:32 PM)
https://github.com/deepseek-ai/DeepSeek-R1/...s/benchmark.jpg
Here is the comparison. The full 671b models need over 400GB of memory to run, which is out of reach for most people.

Distillation transfers the knowledge to smaller model (i.e. feeding the QA chain), but smaller model has way fewer parameters so they won't be generating result so well.

The list of distilled models:
- DeepSeek-R1-Distill-Qwen-1.5B
- DeepSeek-R1-Distill-Qwen-7B
- DeepSeek-R1-Distill-Llama-8B
- DeepSeek-R1-Distill-Qwen-14B
- DeepSeek-R1-Distill-Qwen-32B
- DeepSeek-R1-Distill-Llama-70B

32B can let you run on RTX5090 nicely with context (the QA chain is involved to generate further response), but how many of us can justify buying that 5 digits GPU + heavy TNB bill just to run this?
*
No wonder China having unemployment issues, all taken over by AI already, they just never announce publicly.
Penamer
post Jan 28 2025, 08:51 PM

New Member
*
Junior Member
12 posts

Joined: Aug 2022
Wow. The 春晚 got robot group dance. Amazing.
Penamer
post Jan 29 2025, 09:35 AM

New Member
*
Junior Member
12 posts

Joined: Aug 2022
QUOTE(syyang85 @ Jan 28 2025, 11:10 PM)
hey. any freelancer willing to do a bit coding for hire?

im planning to build a chatbot using deepseek for my online futsal court booking platform. API ready.
deepseek api cost is very attractive compare to chatgpt.

im a bit lazy to code on the side now.

different topic:
US tech stock dipped after deepseek blew up. but I think its a knee jerk reaction.

deepseek is open sourced. chatgpt and the likes can make reference and make changes to their codes. and with their massive AI chip that is only available to them.
its gonna come back MASSIVE.

its good for business and consumers. I think massive discount gonna come soon to chatgpt and the likes when are on par in term of cost with deepseek.

I think have a long position on US AI stocks is still a good look
thoughts?
*
Ask deepseek to code?

Maybe wait a couple months will have deepseek r2 since liangwenfeng just met with Chinese Premier and gotten Hangzhou govt support after rocking US's AI industry? Imagine having entire huawei cloud for his team to train the next version of deepseek.

Penamer
post Jan 29 2025, 10:14 AM

New Member
*
Junior Member
12 posts

Joined: Aug 2022
Haven't gotten over Deepseek's R1? Try Deepseek's AI image generator, Janus Pro 7B



This post has been edited by Penamer: Jan 29 2025, 10:14 AM

 

Change to:
| Lo-Fi Version
0.0142sec    1.22    6 queries    GZIP Disabled
Time is now: 18th December 2025 - 12:02 PM