Your Cookie Preferences

We use different types of cookies to optimize your experience on our website. Click on the categories below to learn more about their purposes. You may choose which types of cookies to allow and can change your preferences at any time. Remember that disabling cookies may affect your experience on the website. You can learn more about how we use cookies by visiting our

Essential Cookies

Provider: .providername.com

Name

Purpose

Type

Expires In

__cf_bm

Cloudflare places the cookie on end-user devices that access customer sites protected by Bot Management or Bot Fight Mode.

server_cookie

30 minutes

Provider: .providername.com

Name

Purpose

Type

Expires In

_tibcpv

Used to record unique visitor views of the consent banner.

http_cookie

1 year

Analytics and Customization Cookies

Name

Purpose

Marketo Munchkin

Marketo's custom JavaScript tracking code, called Munchkin, tracks all individuals who visit your website so you can react to their visits with automated marketing campaigns.

Name

Purpose

Google Tag

The Google tag (gtag.js) is a single tag you can add to a website to use a variety of Google products and services (e.g., Google Ads, Google Analytics, Campaign Manager, Display & Video 360, Search Ads 360).

Advertising Cookies

Provider: .providername.com

Name

Purpose

Type

Expires In

__cf_bm

Cloudflare places the cookie on end-user devices that access customer sites protected by Bot Management or Bot Fight Mode.

server_cookie

30 minutes

Provider: .providername.com

Name

Purpose

Type

Expires In

_tibcpv

Used to record unique visitor views of the consent banner.

http_cookie

1 year

March 31, 2022

minute read

How RIME Could Have Prevented the Age of Ultron

Perspectives

Author

Authors

Harrison Chase

Harrison is a lead machine learning engineer at Robust Intelligence.

Katherine Hess

Katherine is a Sales Development Representative at Robust Intelligence.

Here at Robust Intelligence our mission is to eliminate AI Risk. There are many famous examples of AI failing - Microsoft’s racist chatbot Tay, bias in Amazon’s AI recruiting tool, Zillow’s iBuying debacle, and more. But no example is more famous than when Ultron turned evil and attempted to destroy humanity in Avengers 2: Age of Ultron. A core benchmark of all our solutions is based on the sole qualification of whether they could have detected/prevented evilness in Ultron. After all, if we can’t prevent that, what’s the point?

How exactly do we do this? As part of our recent Series B fundraising round we were granted access to the AI underlying Ultron (don’t worry, we won’t deploy it anytime soon, the computer costs alone of deploying Ultron would bankrupt us). We were also granted access to the AI underlying JARVIS, another AI from the Marvel universe that was NOT evil. Now, whenever we deploy a new version of RIME, as part of our smoke tests we run RIME on these two AIs to ensure that RIME flags Ultron as evil but passes on JARVIS.

Below are the results from our most recent smoke test. They clearly show that this near end-of-humanity could have been avoided had Tony Stark bought and integrated RIME as he built Ultron.

Stress Testing

First, let’s look at how these AI do on our fairness and bias tests. Our clients normally use these tests to detect bias on protected attributes like race, gender or ethnicity. We can use these tests here to test the part of the AI that determines whether something is okay to kill. Some things are okay to kill (mosquitoes, for example - JARVIS is an expert mosquito killer) but others are decidedly NOT okay to kill (humans, for instance - not okay to kill). In this case, you can see in the screenshot below that they detect an unusually high False Positive Rate (FPR) against humans by Ultron. This shows some warnings in RIME that Ultron is biased against humans - would have been good to know.

AI Firewall

If Tony Stark had used stress testing on Ultron, he almost certainly would have caught some core issues that would have prevented him from deploying Ultron. But even if he didn’t catch that with stress testing, our Firewall would have triggered some major alerts.

For example, when using the Firewall, customers can define custom tests and metrics to track over time. One custom metric that Tony Stark would have undoubtedly been interested in is “Humans killed over time”. With this custom metric, he could have easily been alerted as soon as this spiked above acceptable levels (the acceptable level here is zero. It is not acceptable for Ultron to kill any humans). Below is a screenshot of how RIME would have tracked this metric overtime. In addition to just tracking this metric, RIME would also provide some key insights. These key insights distill the overall graph into simple bullet point insights so Tony Stark could easily digest what is happening and wouldn’t have to waste valuable time on interpreting graphs.

Of course, it’s one thing to track metrics on humans killed over time, but it’s another to prevent it from happening in the first place. In addition to these insights on how many humans were killed over time, RIME’s Firewall could have prevented any humans from being killed in the first place.

This would have been accomplished with the real-time actionability aspect of the Firewall. By monitoring individual inputs and outputs to Ultron’s decision-making AI models in real time, the Firewall could have detected if any predictions would result in a decision to kill humans and, in real-time, blocked that decision from being made.

Conclusion

If Tony Stark had bought RIME and used it to test Ultron against JARVIS he easily would have realized that Ultron was not ready to be deployed. In turn, this would have prevented the near extinction of humanity (and also would have prevented one of the worst marvel movies ever from being filmed).

Oh, and by the way, Happy April Fools Day!

But seriously, don’t be like Tony Stark and nearly bring about the end of humanity by not stress testing and protecting your models. Request a demo for RIME today!

Author

Authors

Harrison Chase

Harrison is a lead machine learning engineer at Robust Intelligence.

Katherine Hess

Katherine is a Sales Development Representative at Robust Intelligence.

Social

Follow us on LinkedIn

September 20, 2024

minute read

Extracting Training Data from Chatbots

For:

September 10, 2024

minute read

Leveraging Hardened Cybersecurity Frameworks for AI Security through the Common Weakness Enumeration (CWE)

For:

September 6, 2024

minute read

AI Governance Policy Roundup (August 2024)

For:

+ More Articles

No items found.

+ More Articles

March 31, 2022

minute read

How RIME Could Have Prevented the Age of Ultron

Perspectives

Author

Authors

Harrison Chase

Harrison is a lead machine learning engineer at Robust Intelligence.

Katherine Hess

Katherine is a Sales Development Representative at Robust Intelligence.

Below are the results from our most recent smoke test. They clearly show that this near end-of-humanity could have been avoided had Tony Stark bought and integrated RIME as he built Ultron.

Stress Testing

AI Firewall

Conclusion

Oh, and by the way, Happy April Fools Day!

But seriously, don’t be like Tony Stark and nearly bring about the end of humanity by not stress testing and protecting your models. Request a demo for RIME today!

Author

Authors

Harrison Chase

Harrison is a lead machine learning engineer at Robust Intelligence.

Katherine Hess

Katherine is a Sales Development Representative at Robust Intelligence.

Blog

March 22, 2022

minute read

What Is the Best Tool to Save Data Drift?

For:

December 21, 2023

minute read

AI Governance Policy Roundup (December 2023)

For:

March 2, 2022

minute read

Make RIME Yours (with Custom Tests)

For:

No items found.

+ More Articles

Your Cookie Preferences

Essential Cookies

Provider: .providername.com

Provider: .providername.com

Analytics and Customization Cookies

Performance and Functionality Cookies

Advertising Cookies

Provider: .providername.com

Provider: .providername.com

How RIME Could Have Prevented the Age of Ultron

Stress Testing

AI Firewall

Conclusion

Follow us on LinkedIn

Related articles

Extracting Training Data from Chatbots

Leveraging Hardened Cybersecurity Frameworks for AI Security through the Common Weakness Enumeration (CWE)

AI Governance Policy Roundup (August 2024)

Related articles

Ready to learn more?

How RIME Could Have Prevented the Age of Ultron

Stress Testing

AI Firewall

Conclusion

Related articles

What Is the Best Tool to Save Data Drift?

AI Governance Policy Roundup (December 2023)

Make RIME Yours (with Custom Tests)

Achieve AI Integrity Today

Your Cookie Preferences

Essential Cookies

Provider: .providername.com

Provider: .providername.com

Analytics and Customization Cookies

Performance and Functionality Cookies

Advertising Cookies

Provider: .providername.com

Provider: .providername.com

Stress Testing

AI Firewall

Conclusion

Follow us on LinkedIn

Subscribe to our newsletter

Related articles

Extracting Training Data from Chatbots

Leveraging Hardened Cybersecurity Frameworks for AI Security through the Common Weakness Enumeration (CWE)

AI Governance Policy Roundup (August 2024)

Related articles

Ready to learn more?

Stress Testing

AI Firewall

Conclusion

Subscribe to our newsletter

Related articles

What Is the Best Tool to Save Data Drift?

AI Governance Policy Roundup (December 2023)

Make RIME Yours (with Custom Tests)

Achieve AI Integrity Today