Warning: file_put_contents(aCache/aDaily/post/ai_machinelearning_big_data/-7469-7470-7469-): Failed to open stream: No space left on device in /var/www/tg-me/post.php on line 50
Machinelearning | Telegram Webview: ai_machinelearning_big_data/7469 -
Telegram Group & Telegram Channel
πŸ”₯ Π Π΅Π»ΠΈΠ· Qwen 3 ΠΎΡ‚ Alibaba

Π’ Ρ€Π΅Π»ΠΈΠ· вошли 2 MoE-ΠΌΠΎΠ΄Π΅Π»ΠΈ ΠΈ 6 Dense models (ΠΏΠ»ΠΎΡ‚Π½Ρ‹Π΅ ΠΌΠΎΠ΄Π΅Π»ΠΈ), Ρ€Π°Π·ΠΌΠ΅Ρ€ΠΎΠΌ ΠΎΡ‚ 0.6B Π΄ΠΎ 235B ΠΏΠ°Ρ€Π°ΠΌΠ΅Ρ‚Ρ€ΠΎΠ².

πŸ† Ѐлагманская модСль Qwen3-235B-A22B дСмонстрируСт ΠΊΠΎΠ½ΠΊΡƒΡ€Π΅Π½Ρ‚Π½Ρ‹Π΅ Ρ€Π΅Π·ΡƒΠ»ΡŒΡ‚Π°Ρ‚Ρ‹ Π² Π·Π°Π΄Π°Ρ‡Π°Ρ… Кодина, ΠΌΠ°Ρ‚Π΅ΠΌΠ°Ρ‚ΠΈΠΊΠΈ ΠΈ ΠΎΠ±Ρ‰ΠΈΡ… способностСй, ΡƒΠ²Π΅Ρ€Π΅Π½Π½ΠΎ сопСрничая с ΠΏΠ΅Ρ€Π΅Π΄ΠΎΠ²Ρ‹ΠΌΠΈ модСлями, Ρ‚Π°ΠΊΠΈΠΌΠΈ ΠΊΠ°ΠΊ DeepSeek-R1, o1, o3-mini, Grok-3 ΠΈ Gemini-2.5-Pro.
⚑ НСбольшая MoE-модСль Qwen3-30B-A3B прСвосходит QwQ-32B,  ΠΈΡΠΏΠΎΠ»ΡŒΠ·ΡƒΡŽ Π² 10 Ρ€Π°Π· мСньшС ΠΏΠ°Ρ€Π°ΠΌΠ΅Ρ‚Ρ€ΠΎΠ².
πŸ”₯ ΠšΠΎΠΌΠΏΠ°ΠΊΡ‚Π½Π°Ρ модСль Qwen3-4B сопоставима ΠΏΠΎ ΠΏΡ€ΠΎΠΈΠ·Π²ΠΎΠ΄ΠΈΡ‚Π΅Π»ΡŒΠ½ΠΎΡΡ‚ΠΈ с Qwen2.5-72B-Instruct.
🧠 ΠŸΠΎΠ΄Π΄Π΅Ρ€ΠΆΠΈΠ²Π°Π΅Ρ‚ Π³ΠΈΠ±Ρ€ΠΈΠ΄Π½Ρ‹ΠΉ Ρ€Π΅ΠΆΠΈΠΌ ΠΌΡ‹ΡˆΠ»Π΅Π½ΠΈΡ

Π Π΅ΠΆΠΈΠΌ Ρ€Π°Π·ΠΌΡ‹ΡˆΠ»Π΅Π½ΠΈΡ активируСтся ΠΏΡ€ΠΈ ΠΎΠ±Ρ€Π°Π±ΠΎΡ‚ΠΊΠ΅ слоТных Π·Π°Π΄Π°Ρ‡, обСспСчивая ΠΏΠΎΡˆΠ°Π³ΠΎΠ²Ρ‹ΠΉ Π°Π½Π°Π»ΠΈΠ· запроса ΠΈ Ρ„ΠΎΡ€ΠΌΠΈΡ€ΠΎΠ²Π°Π½ΠΈΠ΅ комплСксных, Π³Π»ΡƒΠ±ΠΎΠΊΠΈΡ… ΠΎΡ‚Π²Π΅Ρ‚ΠΎΠ².

Π‘Π°Π·ΠΎΠ²Ρ‹ΠΉ Ρ€Π΅ΠΆΠΈΠΌ ΠΈΡΠΏΠΎΠ»ΡŒΠ·ΡƒΠ΅Ρ‚ΡΡ для повсСднСвных вопросов, позволяя Π²Ρ‹Π΄Π°Π²Π°Ρ‚ΡŒ быстрыС ΠΈ Ρ‚ΠΎΡ‡Π½Ρ‹Π΅ ΠΎΡ‚Π²Π΅Ρ‚Ρ‹ с минимальной Π·Π°Π΄Π΅Ρ€ΠΆΠΊΠΎΠΉ.

ΠŸΡ€ΠΎΡ†Π΅ΡΡ обучСния ΠΌΠΎΠ΄Π΅Π»ΠΈ устроСн ΠΏΠΎΡ…ΠΎΠΆΠΈΠΌ ΠΎΠ±Ρ€Π°Π·ΠΎΠΌ Π½Π° Ρ‚ΠΎ, ΠΊΠ°ΠΊ это сдСлано Π² DeepSeek R1.

ΠŸΠΎΠ΄Π΄Π΅Ρ€ΠΆΠΈΠ²Π°Π΅Ρ‚ 119 языков, Π²ΠΊΠ»ΡŽΡ‡Π°Ρ русский.

Π›ΠΈΡ†Π΅Π½Π·ΠΈΡ€ΠΎΠ²Π°Π½ΠΈΠ΅: Apache 2.0 πŸ”₯

πŸ”œΠŸΠΎΠΏΡ€ΠΎΠ±ΠΎΠ²Π°Ρ‚ΡŒ: https://chat.qwen.ai/
πŸ”œBlog: https://qwenlm.github.io/blog/qwen3/
πŸ”œGitHub: https://github.com/QwenLM/Qwen3
πŸ”œHugging Face: https://huggingface.co/collections/Qwen/qwen3-67dd247413f0e2e4f653967f
πŸ”œ ModelScope: https://modelscope.cn/collections/Qwen3-9743180bdc6b48

@ai_machinelearning_big_data

#Qwen
Please open Telegram to view this post
VIEW IN TELEGRAM
Please open Telegram to view this post
VIEW IN TELEGRAM



tg-me.com/ai_machinelearning_big_data/7469
Create:
Last Update:

πŸ”₯ Π Π΅Π»ΠΈΠ· Qwen 3 ΠΎΡ‚ Alibaba

Π’ Ρ€Π΅Π»ΠΈΠ· вошли 2 MoE-ΠΌΠΎΠ΄Π΅Π»ΠΈ ΠΈ 6 Dense models (ΠΏΠ»ΠΎΡ‚Π½Ρ‹Π΅ ΠΌΠΎΠ΄Π΅Π»ΠΈ), Ρ€Π°Π·ΠΌΠ΅Ρ€ΠΎΠΌ ΠΎΡ‚ 0.6B Π΄ΠΎ 235B ΠΏΠ°Ρ€Π°ΠΌΠ΅Ρ‚Ρ€ΠΎΠ².

πŸ† Ѐлагманская модСль Qwen3-235B-A22B дСмонстрируСт ΠΊΠΎΠ½ΠΊΡƒΡ€Π΅Π½Ρ‚Π½Ρ‹Π΅ Ρ€Π΅Π·ΡƒΠ»ΡŒΡ‚Π°Ρ‚Ρ‹ Π² Π·Π°Π΄Π°Ρ‡Π°Ρ… Кодина, ΠΌΠ°Ρ‚Π΅ΠΌΠ°Ρ‚ΠΈΠΊΠΈ ΠΈ ΠΎΠ±Ρ‰ΠΈΡ… способностСй, ΡƒΠ²Π΅Ρ€Π΅Π½Π½ΠΎ сопСрничая с ΠΏΠ΅Ρ€Π΅Π΄ΠΎΠ²Ρ‹ΠΌΠΈ модСлями, Ρ‚Π°ΠΊΠΈΠΌΠΈ ΠΊΠ°ΠΊ DeepSeek-R1, o1, o3-mini, Grok-3 ΠΈ Gemini-2.5-Pro.
⚑ НСбольшая MoE-модСль Qwen3-30B-A3B прСвосходит QwQ-32B,  ΠΈΡΠΏΠΎΠ»ΡŒΠ·ΡƒΡŽ Π² 10 Ρ€Π°Π· мСньшС ΠΏΠ°Ρ€Π°ΠΌΠ΅Ρ‚Ρ€ΠΎΠ².
πŸ”₯ ΠšΠΎΠΌΠΏΠ°ΠΊΡ‚Π½Π°Ρ модСль Qwen3-4B сопоставима ΠΏΠΎ ΠΏΡ€ΠΎΠΈΠ·Π²ΠΎΠ΄ΠΈΡ‚Π΅Π»ΡŒΠ½ΠΎΡΡ‚ΠΈ с Qwen2.5-72B-Instruct.
🧠 ΠŸΠΎΠ΄Π΄Π΅Ρ€ΠΆΠΈΠ²Π°Π΅Ρ‚ Π³ΠΈΠ±Ρ€ΠΈΠ΄Π½Ρ‹ΠΉ Ρ€Π΅ΠΆΠΈΠΌ ΠΌΡ‹ΡˆΠ»Π΅Π½ΠΈΡ

Π Π΅ΠΆΠΈΠΌ Ρ€Π°Π·ΠΌΡ‹ΡˆΠ»Π΅Π½ΠΈΡ активируСтся ΠΏΡ€ΠΈ ΠΎΠ±Ρ€Π°Π±ΠΎΡ‚ΠΊΠ΅ слоТных Π·Π°Π΄Π°Ρ‡, обСспСчивая ΠΏΠΎΡˆΠ°Π³ΠΎΠ²Ρ‹ΠΉ Π°Π½Π°Π»ΠΈΠ· запроса ΠΈ Ρ„ΠΎΡ€ΠΌΠΈΡ€ΠΎΠ²Π°Π½ΠΈΠ΅ комплСксных, Π³Π»ΡƒΠ±ΠΎΠΊΠΈΡ… ΠΎΡ‚Π²Π΅Ρ‚ΠΎΠ².

Π‘Π°Π·ΠΎΠ²Ρ‹ΠΉ Ρ€Π΅ΠΆΠΈΠΌ ΠΈΡΠΏΠΎΠ»ΡŒΠ·ΡƒΠ΅Ρ‚ΡΡ для повсСднСвных вопросов, позволяя Π²Ρ‹Π΄Π°Π²Π°Ρ‚ΡŒ быстрыС ΠΈ Ρ‚ΠΎΡ‡Π½Ρ‹Π΅ ΠΎΡ‚Π²Π΅Ρ‚Ρ‹ с минимальной Π·Π°Π΄Π΅Ρ€ΠΆΠΊΠΎΠΉ.

ΠŸΡ€ΠΎΡ†Π΅ΡΡ обучСния ΠΌΠΎΠ΄Π΅Π»ΠΈ устроСн ΠΏΠΎΡ…ΠΎΠΆΠΈΠΌ ΠΎΠ±Ρ€Π°Π·ΠΎΠΌ Π½Π° Ρ‚ΠΎ, ΠΊΠ°ΠΊ это сдСлано Π² DeepSeek R1.

ΠŸΠΎΠ΄Π΄Π΅Ρ€ΠΆΠΈΠ²Π°Π΅Ρ‚ 119 языков, Π²ΠΊΠ»ΡŽΡ‡Π°Ρ русский.

Π›ΠΈΡ†Π΅Π½Π·ΠΈΡ€ΠΎΠ²Π°Π½ΠΈΠ΅: Apache 2.0 πŸ”₯

πŸ”œΠŸΠΎΠΏΡ€ΠΎΠ±ΠΎΠ²Π°Ρ‚ΡŒ: https://chat.qwen.ai/
πŸ”œBlog: https://qwenlm.github.io/blog/qwen3/
πŸ”œGitHub: https://github.com/QwenLM/Qwen3
πŸ”œHugging Face: https://huggingface.co/collections/Qwen/qwen3-67dd247413f0e2e4f653967f
πŸ”œ ModelScope: https://modelscope.cn/collections/Qwen3-9743180bdc6b48

@ai_machinelearning_big_data

#Qwen

BY Machinelearning





Share with your friend now:
tg-me.com/ai_machinelearning_big_data/7469

View MORE
Open in Telegram


Machinelearning Telegram | DID YOU KNOW?

Date: |

The lead from Wall Street offers little clarity as the major averages opened lower on Friday and then bounced back and forth across the unchanged line, finally finishing mixed and little changed.The Dow added 33.18 points or 0.10 percent to finish at 34,798.00, while the NASDAQ eased 4.54 points or 0.03 percent to close at 15,047.70 and the S&P 500 rose 6.50 points or 0.15 percent to end at 4,455.48. For the week, the Dow rose 0.6 percent, the NASDAQ added 0.1 percent and the S&P gained 0.5 percent.The lackluster performance on Wall Street came on uncertainty about the outlook for the markets following recent volatility.

The global forecast for the Asian markets is murky following recent volatility, with crude oil prices providing support in what has been an otherwise tough month. The European markets were down and the U.S. bourses were mixed and flat and the Asian markets figure to split the difference.The TSE finished modestly lower on Friday following losses from the financial shares and property stocks.For the day, the index sank 15.09 points or 0.49 percent to finish at 3,061.35 after trading between 3,057.84 and 3,089.78. Volume was 1.39 billion shares worth 1.30 billion Singapore dollars. There were 285 decliners and 184 gainers.

Machinelearning from sg


Telegram Machinelearning
FROM USA