"Smartest AI On Earth" ?

Lowyat.NET forums

Lowyat.NET Kopitiam Garage Sales

Lowyat.NET Rules and Regulations FAQ Help Search Members

Welcome Guest ( Log In | Register )

Lowyat.NET -> Kopitiam

Bump Topic Add Reply RSS Feed

Outline · [ Standard ] · Linear+

"Smartest AI On Earth" ?

views

SUSipohps3	Feb 20 2025, 12:41 PM, updated 10 months ago Show posts by this member only \| Post #1
Regular Senior Member 1,974 posts Joined: Dec 2011	Grok3 is now LIVE! https://grok.com Register to use it free. $30 for SuperGrok The reviews of Grok 3, the latest large language model from Elon Musk's AI company, xAI, comparing it to models like ChatGPT and Gemini. Here are the key points: Grok 3 Overview: * Access and Pricing: Available through X.com for premium subscribers and Grok.com for premium subscribers. * Performance: Achieved top scores on Chatbot Arena, outperforming Gemini and DeepSeek, trained using a massive GPU data center. * Writing Style: Unique, quirky, humorous tone, capable of generating content close to specified word counts. Features and Capabilities: * Multimodal Capabilities: Understands and generates images. * Web Search: Built-in web search, simplifying research. * Content Creation: Can analyze social media trends and create educational content. * Deep Search: Fast deep search function, processing numerous sources quickly. * Data Analysis: Analyzes data files but struggles with graph creation. * Reasoning Model: Showed initial reasoning limitations but correctly answered complex questions in later tests. Comparison and Conclusion: * Strengths: Speed, real-time information access (especially from X.com), unique tone. * Weaknesses: Lacks writing style customization, limitations in document analysis and data visualization, restrictive limits on reasoning questions. Deep search is decent but potentially less effective than Gemini's. * Value Proposition: Worth trying, especially for social media content and real-time X.com data analysis. For advanced features, ChatGPT, Claude, and Google may offer more. Recommendation: * Grok 3 is impressive but may not replace current AI tools like ChatGPT or Claude due to limitations. Consider it as an addition for specific use cases like social media content creation and real-time information from X.com. This post has been edited by ipohps3: Feb 20 2025, 12:42 PM
Card PM	Report Top Like Quote Reply

cakoilembutgebu	Feb 20 2025, 01:08 PM Show posts by this member only \| Post #2
Getting Started Junior Member 152 posts Joined: Nov 2020	A. Ibrahim
Card PM	Report Top Like Quote Reply

Frozen_Sun	Feb 20 2025, 01:20 PM Show posts by this member only \| Post #3
New Member Junior Member 41 posts Joined: Sep 2013	Grok 3, developed by xAI, is described as a significant leap forward from Grok 2, with claims of being "an order of magnitude more capable" and trained with 10 times the computational power. While precise, side-by-side examples from official sources are limited due to Grok 3’s recent release (February 17, 2025), insights from xAI’s announcements, expert commentary, and user sentiment on X provide a picture of its enhanced intelligence. Below, I’ll outline how Grok 3 is smarter than Grok 2 and provide illustrative examples—some based on reported capabilities and others inferred from its stated advancements. ### How Grok 3 is Smarter Than Grok 2 1. Enhanced Reasoning and Problem-Solving Grok 3 introduces advanced reasoning capabilities, including a "Think" mode that breaks down problems step-by-step, reducing errors and improving logical consistency. It’s designed to tackle complex tasks in math, science, and coding more effectively than Grok 2. The increased compute (200,000 GPUs vs. Grok 2’s roughly 20,000) allows it to process larger datasets and refine its logic. 2. Improved Contextual Understanding With a 128K-token context window inherited from Grok 2, Grok 3 uses it 20% more efficiently due to better memory management. This means it can maintain coherence over longer conversations and pull in more relevant details from prior interactions or real-time X data. 3. Multimodal Capabilities Unlike Grok 2, which was primarily text-focused with basic image processing, Grok 3 handles text, images, audio, and potentially video. This broadens its ability to interpret and respond to diverse inputs. 4. Faster Learning and Adaptability Grok 3’s training on synthetic datasets and self-correction mechanisms allows it to learn from mistakes in real time, making it more adaptable than Grok 2, which relied on static training improvements. 5. Performance on Benchmarks xAI claims Grok 3 outperforms Grok 2 across various benchmarks (e.g., math, science, coding), with some experts likening its reasoning to OpenAI’s o1-pro model, a significant jump from Grok 2’s capabilities, which were closer to GPT-4 levels. ### Examples for Grok 2 - Math Problem (Middle-School Level) Prompt: "If a train leaves Station A at 9 AM traveling 60 mph and another leaves Station B at 10 AM traveling 80 mph, when do they meet if the stations are 200 miles apart?" Grok 2 Response: It might calculate the relative speed (140 mph when moving toward each other) and determine the distance covered after the second train starts (200 miles - 60 miles = 140 miles left). Time = 140 / 140 = 1 hour, so they meet at 11 AM. However, Grok 2 could stumble on edge cases or fail to clarify assumptions (e.g., direction), occasionally giving a less precise answer like "around 11 AM." - Coding Task (Simple Script) Prompt: "Write a Python script to reverse a string." Grok 2 Response: ```python def reverse_string(s): return s[::-1] print(reverse_string("hello")) # Outputs: "olleh" ``` Grok 2 handles this well but might not explain the slicing method or catch edge cases (e.g., empty strings) without prompting. - General Knowledge Query Prompt: "What’s the tallest mountain in the world?" Grok 2 Response: "Mount Everest is the tallest mountain, standing at 29,029 feet above sea level." It’s accurate but might not add context like Mauna Kea’s height from base to peak unless asked. ### Examples for Grok 3 - Math Problem (Advanced Reasoning) Prompt: Same as above. Grok 3 Response: "Let’s break this down. Train A starts at 9 AM at 60 mph, so by 10 AM, it’s covered 60 miles, leaving 140 miles between them. Train B starts at 10 AM at 80 mph. Assuming they’re moving toward each other, their relative speed is 60 + 80 = 140 mph. Time to close 140 miles = 140 / 140 = 1 hour. Thus, they meet at 11 AM. If they’re not moving toward each other, please clarify!" Why Smarter: Grok 3 shows its reasoning step-by-step and proactively addresses ambiguities, unlike Grok 2’s potentially simpler output. - Coding Task (Complex Debugging) Prompt: "Debug this C code snippet with unused variables: `int x = 5; int y = 10; int z; printf("%d", x);`." Grok 3 Response: "Here’s the analysis: `x` is initialized and used in `printf`, which outputs 5. `y` is initialized but unused, and `z` is declared but neither initialized nor used—compilers might warn about both. Cleaned-up version: ```c int x = 5; printf("%d", x); ``` This removes `y` and `z` since they’re irrelevant." Why Smarter: Users on X noted Grok 3 catching unused variables in early builds, suggesting it’s more perceptive than Grok 2, which might just format the code without such insight. - Multimodal Query Prompt: "Describe this image: [a photo of a grey cat sleeping on a cushion under a desk]." Grok 3 Response: "The image shows a grey Persian cat curled up asleep on a round cushion beneath a light oak desk. The scene is cozy, with natural light streaming in from large windows on the left, filtered by sheer white curtains." Why Smarter: Grok 2 might describe basic elements ("a cat on a cushion"), but Grok 3’s multimodal prowess adds detail and context, leveraging its image-understanding upgrade. ### Key Takeaways Grok 3’s improvements stem from its massive compute boost, refined reasoning engine, and expanded capabilities. While Grok 2 was competent—matching or exceeding GPT-4 in some areas—Grok 3 aims for the frontier, competing with models like OpenAI’s o1. Examples show Grok 3 providing clearer, more detailed, and adaptive responses, making it noticeably smarter in practical use. However, since Grok 3 is in beta, real-world performance may still evolve as xAI refines it daily.
Card PM	Report Top Like Quote Reply

akatsuki_91	Feb 20 2025, 11:50 PM Show posts by this member only \| Post #4
Getting Started Junior Member 89 posts Joined: Apr 2010	Grok 3 response time not bad. netmatrix liked this post
Card PM	Report Top Like Quote Reply

SUSipohps3	Feb 20 2025, 11:59 PM Show posts by this member only \| IPv6 \| Post #5
Regular Senior Member 1,974 posts Joined: Dec 2011	QUOTE(akatsuki_91 @ Feb 20 2025, 11:50 PM) Grok 3 response time not bad. 200k Nvidia H100 data centre largest in the world own by richest in the world.
Card PM	Report Top Like Quote Reply

angelgemini	Feb 21 2025, 12:08 AM Show posts by this member only \| IPv6 \| Post #6
... Senior Member 1,751 posts Joined: Jun 2005 From: Malaysia	QUOTE(ipohps3 @ Feb 20 2025, 11:59 PM) 200k Nvidia H100 data centre largest in the world own by richest in the world. inb4 required 200k H100 while other just need few years old card but similar or out performance it.
Card PM	Report Top Like Quote Reply

akatsuki_91	Feb 21 2025, 12:20 AM Show posts by this member only \| Post #7
Getting Started Junior Member 89 posts Joined: Apr 2010	Nobody can beat my mind. I can koyakkan any AI including Grok 3.😊👎
Card PM	Report Top Like Quote Reply

akatsuki_91	Feb 21 2025, 12:23 AM Show posts by this member only \| Post #8
Getting Started Junior Member 89 posts Joined: Apr 2010	Grok isn't the smartest AI. I don't think the public has much interest on this. ✌️
Card PM	Report Top Like Quote Reply

Xploit Machine	Feb 21 2025, 12:24 AM Show posts by this member only \| IPv6 \| Post #9
❤️ 孤独な旅行者 ❤️ Senior Member 7,268 posts Joined: Nov 2008 From: SINGAPORE💃	fastest & coolest AI .. _JS-LkBrsk8
Card PM	Report Top Like Quote Reply

SUSipohps3	Feb 21 2025, 12:35 AM Show posts by this member only \| IPv6 \| Post #10
Regular Senior Member 1,974 posts Joined: Dec 2011	QUOTE(akatsuki_91 @ Feb 21 2025, 12:20 AM) Nobody can beat my mind. I can koyakkan any AI including Grok 3.😊👎 i think you are rate limited. free version got quota one. akatsuki_91 liked this post
Card PM	Report Top Like Quote Reply

akatsuki_91	Feb 21 2025, 12:49 AM Show posts by this member only \| Post #11
Getting Started Junior Member 89 posts Joined: Apr 2010	QUOTE(ipohps3 @ Feb 21 2025, 01:35 AM) i think you are rate limited. free version got quota one. They always need money, money, money... Pandai make money all these billionaires ~
Card PM	Report Top Like Quote Reply

« Next Oldest · Kopitiam · Next Newest »

Add Reply Options

Change to:

0.0200sec

1.44

5 queries

GZIP Disabled
Time is now: 19th December 2025 - 07:41 PM

All Rights Reserved © 2002- 2025 Vijandren Ramadass (~unite against racism~)

Removal Request

Powered by Invision Power Board © 2025 IPS, Inc.