Jennifer Gill
Wed 20 Sep

Skyhawk Security Launches Comprehensive Generative AI Benchmark Ranking LLMs Based on Cyber Threat Scoring Capabilities

Press Releases

Free resource analyzes the performance of ChatGPT, Google BARD, Claude, LLAMA2-based open LLMs.

TEL AVIV, Israel, September 20, 2023 – Skyhawk Security, the originator of cloud threat detection and response, today launched the industry’s first benchmark for evaluating large language models’ (LLMs) ability to identify and score cybersecurity threats within various cloud logs and telemetries. The resource also provides a ranking of these LLMs based on their performance. As part of efforts to strengthen the broader cloud security industry, the data will be regularly updated and available to view free of charge on Skyhawk’s website.

The benchmark and LLM leaderboard will be formally presented today during a session led by Skyhawk’s Director of AI and Research, Amir Shachar, at the Cloud Security Alliance’s SECtember conference. The session takes place at 1:30 p.m. Pacific in room 405.

“The importance of swiftly and effectively detecting cloud security threats cannot be overstated. We firmly believe that harnessing generative AI can greatly benefit security teams in that regard, however, not all large language models are created equal,” said Amir Shachar. “In creating this benchmark, we hope to increase confidence in the power of LLMs for cloud security by providing a clear view of how well these tools can classify malicious activities. We’re testing them for you on human-labeled attack flow sequences based on business-driven evaluation metrics. We also integrate human security researchers’ insights with self-improving LLM-based AI agents to enhance the classification process.”

In this benchmark, Skyhawk looks at ChatGPT, Google Bard, Falcon and other LLAMA2-based open LLMs. The goal was to see how accurately each of these LLMs predicted the maliciousness of an attack sequence that was extracted and created by Skyhawk Security’s machine learning models. The output from the models was compared to a sample of hundreds of human-labeled sequences and scored in three ways: Precision, Recall and F1 Score. The closer to “one” the scores are, the more accurate the predictability of the LLM.

The release of Skyhawk’s LLM benchmark reinforces the company’s dedication to innovating with generative AI in the cloud security space. The news comes on the heels of the launch of Skyhawk’s Shift Left CDR solution within its existing Skyhawk Synthesis Security Platform. The novel approach shifts the threat detection process to the “left,” or the perimeter, of the cloud network as well as IAM. Skyhawk’s cloud threat detection and response uses contextual analysis of the cloud infrastructure and determines potential paths hackers could take to a company’s “crown jewels.” This information enables security teams to identify serious threats much earlier in the incident and prioritize those that pose the highest risk to crown jewels to prevent them from becoming a breach.

To learn more about Skyhawk Security’s product offering, visit https://skyhawk.security/. For continuing updates follow Skyhawk Security on LinkedIn and Twitter.

About Skyhawk Security

Skyhawk Security is the originator of Cloud Threat Detection and Response (CDR), helping hundreds of users map and remediate sophisticated threats to cloud infrastructure in minutes. Led by a team of cybersecurity and cloud professionals who built the original CSPM category, Skyhawk Security evolves cloud security posture management far beyond scanning and static configuration analysis. Instead, using advanced generative AI and ML sequencing of context-based behaviors, Skyhawk provides CDR within a ‘Runtime Hub’ to quickly detect and remediate malicious activities across multiple cloud platforms as they happen. Skyhawk Security is a spin-off of Radware® (NASDAQ:RDWR).

Media Contacts:

Sherlyn Rijos-Altman

Montner Tech PR

srijos@montner.com

Press Release

February 2, 2026

Jennifer Duman of Skyhawk Security Named a 2026 CRN® Channel Chief

Recognition spotlights Duman’s leadership in scaling Skyhawk’s channel-first strategy and accelerating partner momentum globally TEL AVIV, Israel, February 2, 2026 – Skyhawk Security, the leader in AI-based purple team-powered cloud security, today announced that CRN®, a brand of The

Management

Press Release

August 4, 2025

Skyhawk Security and Scytale Partner to Streamline SOC 2 Compliance

Scytale customers gain complimentary access to Skyhawk’s Purple Team Assessment to demonstrate security readiness and adequate organization controls in cloud security environments BLACK HAT CONFERENCE, LAS VEGAS, August 4, 2025 — Skyhawk Security, the leader in Purple Team-Powered CDR, today

Cloud SecurityThreat Detection

Press Release

August 4, 2025

Skyhawk Security Launches Wiz Integration that Slashes CNAPP Alert Noise by 99%, Uncovers True Threats Hidden Within

New Integration helps busy security teams zero in on weaponized threats, reduce alert fatigue, and reduce operational costs for security and application teams BLACK HAT CONFERENCE, LAS VEGAS, August 4, 2025 — Skyhawk Security, the leader in Purple Team-powered Cloud

Cloud SecurityAI

Press Release

April 23, 2025

Skyhawk Expands AI-powered Purple Team to Secure Cloud Applications

Now preemptively identifies vulnerabilities in cloud applications, prioritizes risks and continuously monitors threats across application and infrastructure, all without agents RSA CONFERENCE, SAN FRANCISCO, April 23, 2025 – Skyhawk Security, the originator of Cloud Detection and Response (CDR), announces a

Management

Press Release

December 2, 2024

Skyhawk Introduces Interactive Cloud Threat Detection to Enable Multi-Factor Cloud Native Zero Trust

AWS re:Invent 2024, LAS VEGAS, December 2, 2024 – Skyhawk Security, the originator of cloud threat detection and response (CDR), is adding an Interactive Cloud Threat Detection and Response capability to its groundbreaking platform. The new capability adds real-time user

Management

Press Release

July 30, 2024

Skyhawk Introduces Complimentary Purple Team Assessment to Empower Channel Partners to Identify Cloud Risks

Expands channel program, enables partners to offer powerful Assessments to their clients BLACK HAT CONFERENCE, LAS VEGAS, July 30, 2024 – Skyhawk Security, the originator of Cloud Threat Detection and Response (CDR), revolutionized cloud security when it introduced the industry’s

Cloud SecurityThreat Detection

See the Purple Team

See the breach before it happens

First Name

Last Name

Company

Country *

I agree to Skyhawk privacy policy & terms

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.