Topic: AI Security & Safety

AI Security

MarinDecember 6, 2022

Introduction to AI-Enabled Disinformation

In recent years, the rise of artificial intelligence (AI) has revolutionized many sectors, bringing about significant advancements in various fields. However, one area where AI has presented a dual-edged sword is in information operations, specifically in the propagation of disinformation. The advent of generative AI, particularly with sophisticated models capable of creating highly realistic text, images, audio, and video, has exponentially increased the risk of…
Read More »
AI Security

MarinDecember 1, 2022

The Unseen Dangers of GAN Poisoning in AI

GAN Poisoning is a unique form of adversarial attack aimed at manipulating Generative Adversarial Networks (GANs) during their training phase; unlike traditional cybersecurity threats like data poisoning or adversarial input attacks, which either corrupt training data or trick already-trained models, GAN Poisoning focuses on altering the GAN's generative capability to produce deceptive or harmful outputs. The objective is not merely unauthorized access but the generation…
Read More »
AI Security

MarinNovember 24, 2022

“Magical” Emergent Behaviours in AI: A Security Perspective

Emergent behaviours in AI have left both researchers and practitioners scratching their heads. These are the unexpected quirks and functionalities that pop up in complex AI systems, not because they were explicitly trained to exhibit them, but due to the intricate interplay of the system's complexity, the sheer volume of data it sifts through, and its interactions with other systems or variables. It's like giving…
Read More »
AI Security

MarinNovember 11, 2022

How Dynamic Data Masking Reinforces Machine Learning Security

Data masking, also known as data obfuscation or data anonymization, serves as a crucial technique for ensuring data confidentiality and integrity, particularly in non-production environments like development, testing, and analytics. It operates by replacing actual sensitive data with a sanitized version, rendering the data ineffective for malicious exploitation while retaining its functional utility for testing or analysis.
Read More »
AI Security

MarinNovember 10, 2022

How Label-Flipping Attacks Mislead AI Systems

Label-flipping attacks refer to a class of adversarial attacks that specifically target the labeled data used to train supervised machine learning models. In a typical label-flipping attack, the attacker changes the labels associated with the training data points, essentially turning "cats" into "dogs" or benign network packets into malicious ones, thereby aiming to train the model on incorrect or misleading associations. Unlike traditional adversarial attacks…
Read More »
AI Security

MarinOctober 11, 2022

Backdoor Attacks in Machine Learning Models

Backdoor attacks in the context of Machine Learning (ML) refer to the deliberate manipulation of a model's training data or its algorithmic logic to implant a hidden vulnerability, often referred to as a "trigger." Unlike typical vulnerabilities that are discovered post-deployment, backdoor attacks are often premeditated and planted during the model's development phase. Once deployed, the compromised ML model appears to function normally for standard…
Read More »
AI Security

MarinAugust 24, 2022

Perturbation Attacks in Text Classification Models

Text Classification Models are critical in a number of cybersecurity controls, particularly in mitigating risks associated with phishing emails and spam. However, the emergence of sophisticated perturbation attacks poses substantial threats, manipulating models into erroneous classifications and exposing inherent vulnerabilities. The explored mitigation strategies, including advanced detection techniques and defensive measures like adversarial training and input sanitization, are instrumental in defending against these attacks, preserving…
Read More »
AI Security

MarinAugust 8, 2022

How Multimodal Attacks Exploit Models Trained on Multiple Data Types

In simplest terms, a multimodal model is a type of machine learning algorithm designed to process more than one type of data, be it text, images, audio, or even video. Traditional models often specialize in one form of data; for example, text models focus solely on textual information, while image recognition models zero in on visual data. In contrast, a multimodal model combines these specializations,…
Read More »

Trending AI Security & AI Safety Posts

All AI Security & AI Safety Posts

Introduction to AI-Enabled Disinformation

The Unseen Dangers of GAN Poisoning in AI

How Dynamic Data Masking Reinforces Machine Learning Security

How Label-Flipping Attacks Mislead AI Systems

Backdoor Attacks in Machine Learning Models

Perturbation Attacks in Text Classification Models

How Multimodal Attacks Exploit Models Trained on Multiple Data Types