Hashtag
The Times

The change marks a transition from so-called "vibe coding" to what researchers increasingly describe as agentic engineering.

LLM Performance Evaluation: Agentic, Reasoning and Coding
LLM Performance Evaluation: Agentic, Reasoning and Coding

Built for this new phase, GLM-5 ranks among the strongest open-source models for coding and autonomous task execution. In practical programming settings, its performance approaches that of Claude Opus 4.5, particularly in complex system design and long-horizon tasks requiring sustained planning and execution.

The model rests on a new architecture aimed at scaling both capability and efficiency. Its parameter count has expanded from 355bn to 744bn, with active parameters rising from 32bn to 40bn, while pre-training data has grown to 28.5trn tokens. These increases are paired with advances in training methods. A framework called Slime enables asynchronous reinforcement learning at a larger scale, allowing the model to learn continuously from extended interactions and improve post-training efficiency. GLM-5 also introduces DeepSeek Sparse Attention, which maintains long-context performance while cutting deployment costs and improving token efficiency.

Benchmarks suggest strong gains. On SWE-bench-Verified and Terminal Bench 2.0, GLM-5 scores 77.8 and 56.2, respectively, the highest reported results for open-source models, surpassing Gemini 3 Pro in several software-engineering tasks. On Vending Bench 2, which simulates running a vending-machine business over a year, it finishes with a balance of $4,432, leading other open-source models in operational and economic management.

These results highlight the qualities required for agentic engineering: maintaining goals across long horizons, managing resources, and coordinating multi-step processes. As models increasingly assume these capabilities, the frontier of AI appears to be shifting from writing code to delivering functioning systems.

Chat & Official API Access

Z.ai Chat: https://chat.z.ai
GLM Coding Plan: https://z.ai/subscribe?utm_source=pr&utm_medium=press&utm_campaign=launch

Open-Source Repositories

GitHub: https://github.com/zai-org/GLM-5
Hugging Face: https://huggingface.co/zai-org/GLM-5

Blog
GLM-5 Technical Blog: https://z.ai/blog/glm-5
Hashtag: #ZAI

The issuer is solely responsible for the content of this announcement.

Five Simple Home Styling Tips Experts Recommend to Reduce Everyday Stress

With stress and burnout continuing to affect many Australians, creating a calm and relaxing home environment has become more imp...

What Do Clinical Teams Need from Their Surgical Supply Partners?

In clinical settings, surgical supply partners aren’t just vendors. They sit quietly behind the scenes of operating lists, speci...

Why Professional Aircon Installation Is Important For Long-Term Performance in Melbourne

Many property owners now invest in aircon installation Melbourne services to improve indoor comfort and maintain reliable climate ...

Why Clear Payment Terms Are Important for Debt Collection

Many businesses find themselves dealing with overdue accounts that could have been avoided, or at least resolved more efficientl...

The Growing Focus on Communication Development in Children

The early developmental years of a child's life represent a critical window for neurological growth, behavioural shaping, and lang...

Looking for a Family Dentist in Sydney? Here's What To Consider

Finding the right family dentist in Sydney is one of the most important health decisions you can make for your household. With hun...

IN THE NEWS

Tips To Use Correct Keywords for Successful Digital Marketing

Have you ever wondered why certain website links are displayed against a search? Some websites do not get.

Integrating AI in Trading: 4 Steps from Global broker Octa

KUALA LUMPUR, MALAYSIA - Media OutReach Newswire - 7 December 2024 - Artificial intelligence is transfor.

Car Tuning for Beginners

Many people are not completely satisfied with their car’s performance and that is quite understanda.

Correcting and Replacing: BUMA Secures IDR 12 Trillion Mining Services Contract from PT Persada Kapu…

Delta Dunia Group announced that its subsidiary, PT Bukit Makmur Mandiri Utama ("BUMA"), has sign.

Long road to recovery for Perth property market

While the Western Australian economy is slowly starting to rise from the doldrums, the overall outlook .

Prudential Singapore sweeps global MDRT Culture of Excellence Awards with 13 agency leaders earning …

Prudential's in-house MDRT programme fosters a culture of knowledge sharing and mentorship and is the eng.

Health & Wellness

What Do Clinical Teams Need from Their Surgical Supply Partners?

Hashtag.net.au - avatar Hashtag.net.au

In clinical settings, surgical supply partners aren’t just vendors. They sit quietly behind the scenes of operating lists, specialist consultations, treatment rooms and recovery workflows. When they...

The Growing Focus on Communication Development in Children

Hashtag.net.au - avatar Hashtag.net.au

The early developmental years of a child's life represent a critical window for neurological growth, behavioural shaping, and language acquisition. During this formative phase, the ability to interpre...

Looking for a Family Dentist in Sydney? Here's What To Consider

Hashtag.net.au - avatar Hashtag.net.au

Finding the right family dentist in Sydney is one of the most important health decisions you can make for your household. With hundreds of practices spread across the city — from Beecroft to Bondi, Pa...

hacklink hack forum hacklink film izle hacklink New Non Gamstop Casinosultrabetjojobet电子书下载zlibraryDeneme bonusu veren siteler 2026Deneme bonusu veren siteler 2026Holiganbetjojobet girişjojobetjojobetjojobetmarsbahisjojobetultrabetjojobetcratosroyalbetbetasus girişmeritbetgrandpashabetjojobetesim usagrandpashabetjojobetjojobetjojobet