No Result
View All Result
World Press Time
  • Home
  • United States
  • UK
  • World
    • Canada
    • Europe
    • Australia
    • Asia
    • South America
    • Africa
  • Politics
  • Business
    • Economy
    • Finance
    • Investing
    • Markets
    • Companies
    • Crypto
  • Lifestyle
  • Entertainment
  • Health
  • Technology
  • Science
  • Sports
  • Travel
  • Contact
Subscribe
  • Login
  • Home
  • United States
  • UK
  • World
    • Canada
    • Europe
    • Australia
    • Asia
    • South America
    • Africa
  • Politics
  • Business
    • Economy
    • Finance
    • Investing
    • Markets
    • Companies
    • Crypto
  • Lifestyle
  • Entertainment
  • Health
  • Technology
  • Science
  • Sports
  • Travel
  • Contact
No Result
View All Result
World Press Time
No Result
View All Result
  • United States
  • UK
  • World
  • Politics
  • Business
  • Lifestyle
  • Entertainment
  • Health
  • Technology
  • Science
  • Sports
  • Travel
  • Videos
Home Business Companies

Microsoft to rank ‘safety’ of AI models sold to cloud customers

Press Room by Press Room
6 months ago
in Companies
Reading Time: 3 mins read
123 4
A A
0
34
SHARES
489
VIEWS
Share on FacebookShare on Twitter

Unlock the Editor’s Digest for free

Roula Khalaf, Editor of the FT, selects her favourite stories in this weekly newsletter.

Microsoft will start ranking artificial intelligence models based on their safety performance, as the software group seeks to build trust with cloud customers as it sells them AI offerings from the likes of OpenAI and Elon Musk’s xAI.

Sarah Bird, Microsoft’s head of Responsible AI, said the company would soon add a “safety” category to its “model leaderboard”, a feature it launched for developers this month to rank iterations from a range of providers including China’s DeepSeek and France’s Mistral.

The leaderboard, which is accessible by tens of thousands of clients using the Azure Foundry developer platform, is expected to influence which AI models and applications are purchased through Microsoft.

Microsoft currently ranks three metrics: quality, cost and throughput, which is how quickly a model can generate an output. Bird told the Financial Times that the new safety ranking would ensure “people can just directly shop and understand” AI models’ capabilities as they decide which to purchase.

The decision to include safety benchmarks comes as Microsoft’s customers grapple with the potential risks posed by new AI models to data and privacy protections, particularly when deployed as autonomous “agents” that can work without human supervision.

Microsoft’s new safety metric will be based on its own ToxiGen benchmark, which measures implicit hate speech, and the Center for AI Safety’s Weapons of Mass Destruction Proxy benchmark. The latter assesses whether a model can be used for malicious purposes such as building a biochemical weapon.

Rankings enable users to have access to objective metrics when selecting from a catalogue of more than 1,900 AI models, so that they can make an informed choice of which to use.

“Safety leader boards can help businesses cut through the noise and narrow down options,” said Cassie Kozyrkov, a consultant and former chief decision scientist at Google. “The real challenge is understanding the trade-offs: higher performance at what cost? Lower cost at what risk?”

Alongside Amazon and Google, the Seattle-based group is considered one of the largest “hyperscalers” that together dominate the cloud market.

Microsoft is also positioning itself as an agnostic platform for generative AI, signing deals to sell models by xAI and Anthropic, rivals to start-up OpenAI which it has backed with roughly $14bn in investment.

Last month, Microsoft said it would begin offering xAI’s Grok family of models under the same commercial terms as OpenAI.

The move came despite a version of Grok raising alarm when an “unauthorised modification” of its code led to it repeatedly referencing “white genocide” in South Africa when responding to queries on social media site X. xAI said it introduced a new monitoring policy to avoid future incidents.

“The models come in a platform, there is a degree of internal review, and then it’s up to the customer to use benchmarks to figure it out,” Bird said.  

There is no global standard for AI safety testing, but the EU’s AI Act will enter force later this year and compel companies to conduct safety tests.

Some model builders including OpenAI are dedicating less time and money to identify and mitigate risks, the FT previously reported citing several people familiar with the start-up’s safety processes. The start-up said it had identified efficiencies without compromising safety.

Bird declined to comment on OpenAI’s safety testing, but said it was impossible to ship a high quality model without investing a “huge amount” in evaluation and that processes were being automated.

Microsoft in April also launched an “AI read teaming agent” that automates the process of stress testing computer programmes by launching attacks to identify vulnerabilities. “You just specify the risk, you specify the attack difficulty . . . And then it’s off attacking your system,” Bird said.

There are concerns that without adequate supervision AI agents could take unauthorised actions opening the owners up to liabilities.

“The risk is that leader boards can lull decision makers into a false sense of security,” said Kozyrkov. “Safety metrics are a starting point, not a green light.”

Read the full article here

Share14Tweet9Share2Pin3SendShareShareShareShare
ADVERTISEMENT

Related Articles

Companies

Abu Dhabi’s Adnoc in deal talks over oil refinery at centre of US sanctions

Companies

Tech elites are starting their own for-profit cities

Companies

Climate rift opens between Amazon and rivals in row over data centre power

Companies

How science can phase out animal testing

Companies

India’s airports in chaos as largest airline cancels hundreds of flights

Companies

Santander executive accused of fraud being probed by Brazil’s central bank

Companies

FCA to grant provisional licences to financial start-ups to help them launch faster

Companies

NHS under pressure to reduce number of private sector operations

Companies

UK banks turn to AI for fraud prevention and to improve services

Load More

Recommended

Luigi Mangione defense team moves to block key backpack evidence

President Trump defers to Secretary Hegseth on boat-strike video release

‘Don’t take him literally,’ Moe says on Trump fertilizer tariff threats

THE MOST IMPORTANT FINANCE NEWS AND EVENTS OF THE DAY

Subscribe to our mailing list to receives daily updates direct to your inbox!

Trending Now

  • On this day in history, Nov. 24, 1874, the first commercially successful barbed wire is patented

    35 shares
    Share 14 Tweet 9
  • New Jersey quintuplets graduate from same university together: ‘Gigantic moment’

    37 shares
    Share 14 Tweet 9
  • Jalen Brunson accepts blame for Knicks’ slow start in loss: ‘A lot of that’s on me’

    34 shares
    Share 14 Tweet 9
  • House GOP leaders demand accountability on Trump assassination attempt: ‘So many questions’

    34 shares
    Share 14 Tweet 9
  • ‘Truly sorry’: Plibersek on the Juukan Gorge Indigenous heritage incident

    34 shares
    Share 14 Tweet 9

About Us

World Press Time

World Press Time is your one-stop news portal, follow us to get the latest politic, business, sports, entertainment any more. follow us now.

Topics

! Без рубрики 9720_sat 9950_prod 10000_prod 10000_sat 10200_prod adobe generative ai 3 Africa Asia Australia blog Bookkeeping Canada casino Companies Crypto Economy Entertainment Europe Finance Forex Trading games guide Health info Investing Lifestyle Markets news omegle Omegle cc Politics Post Science Sober living South America Sports Technology Travel Uncategorized United Kingdom United States updates Videos World

Get informed

THE MOST IMPORTANT FINANCE NEWS AND EVENTS OF THE DAY

Subscribe to our mailing list to receives daily updates direct to your inbox!

  • Privacy Policy
  • Terms of use
  • Press Release
  • Advertise
  • Contact

© 2022 World Press Time - All Rights Reserved.

No Result
View All Result
  • Home
  • United States
  • UK
  • World
    • Canada
    • Europe
    • Australia
    • Asia
    • South America
    • Africa
  • Politics
  • Business
    • Economy
    • Finance
    • Investing
    • Markets
    • Companies
    • Crypto
  • Lifestyle
  • Entertainment
  • Health
  • Technology
  • Science
  • Sports
  • Travel
  • Contact

© 2022 World Press Time - All Rights Reserved.

Welcome Back!

Login to your account below

Forgotten Password?

Retrieve your password

Please enter your username or email address to reset your password.

Log In
This website uses cookies. By continuing to use this website, you are giving consent to cookies being used. Visit our Privacy and Cookie Policy.