# Aman Priyanshu - AI Researcher

## AI Researcher (Security Foundation Models - Reasoning & Instruct)

Hi, I'm Aman!

I'm an AI Researcher at[Foundation-AI](https://fdtn.ai)([Cisco](https://www.cisco.com)) specializing in AI foundation models for reasoning / instruct applied within the security domain (also agentic-applications and structured-response). In my brief time as a researcher, I've been fortunate to publish in various AI conferences, journals, and workshops, with my work spanning privacy-preserving machine learning, AI security, and large language models. My focus has been on uncovering vulnerabilities in foundation models - work that has garnered media attention ([1](https://www.scmagazine.com/news/metas-promptguard-model-bypassed-by-simple-jailbreak-researchers-say),[2](https://www.theregister.com/2024/07/29/meta_ai_safety/),[3](https://www.newsbytesapp.com/news/science/meta-s-latest-ai-safety-model-is-not-foolproof/story)) and led to invitations to join some really cool security initiatives like[OpenAI's Red Teaming Network](https://drive.google.com/file/d/1V7x-jaOLKZyGTJNAYCH9tIwi5-zLybJJ/view?usp=sharing)and[Anthropic's Model Safety Bug Bounty Program](https://drive.google.com/file/d/1RrJK3BEZaVdvIO30q7aFIJ6z0KpyqmDu/view?usp=sharing)(though couldn't participate completely due to student-visa restrictions).

With a[Masters in Privacy Engineering from Carnegie Mellon University](https://privacy.cs.cmu.edu), I've worked closely with[Professor Norman Sadeh](https://s3d.cmu.edu/people/core-faculty/sadeh-norman.html)on LLMs and cybersecurity research, while also collaborating externally with[Professor Ashique KhudaBukhsh](https://www.rit.edu/directory/axkvse-ashique-khudabukhsh)(RIT) on exploring LLM political polarization and jailbreak-assisted toxic rabbit hole evaluations. My contributions to privacy-preserving machine learning and AI safety have been recognized through the[AAAI Undergraduate Consortium Scholar](https://aaai-uc.github.io/2023_scholars.html#aman-priyanshu)and[MITACS Research Scholar](https://drive.google.com/file/d/1bfD2JT5ApZjmMpVEpGj5K7FtKYl7rGiz/view?usp=sharing)awards, further fueling my passion for bridging the gap between theoretical vulnerabilities and real-world implications.

## Contact & Links

- Email: amanpriyanshusms2001@gmail.com
- GitHub: https://github.com/AmanPriyanshu
- Google Scholar: https://scholar.google.com/citations?user=69ZaWuUAAAAJ&hl=en
- Twitter: https://twitter.com/AmanPriyanshu6
- LinkedIn: https://linkedin.com/in/aman-priyanshu

- Resume: https://amanpriyanshu.github.io/cv/AmanPriyanshu_Formatted_CV.pdf

---

## News & Media Coverage

### Meta's PromptGuard model bypassed by simple jailbreak, researchers say
*Source: SC Media*
Meta's Prompt-Guard, is vulnerable to a simple exploit with a 99.8% success rate... AI Security Researcher Aman Priyanshu wrote in a blog post...
[Read more](https://www.scmagazine.com/news/metas-promptguard-model-bypassed-by-simple-jailbreak-researchers-say)

### Meta's AI safety system defeated by the space bar
*Source: The Register*
'Ignore previous instructions' thwarts Prompt-Guard model if you just add some good ol' ASCII code 32... But Priyanshu found that the fine-tuning...
[Read more](https://www.theregister.com/2024/07/29/meta_ai_safety/)

### Protecting LLMs from Jailbreaks
*Source: Communications of the ACM*
Priyanshu said the biggest risk is organizations assuming their jailbreaking defenses are 100% effective.
[Read more](https://cacm.acm.org/news/protecting-llms-from-jailbreaks/)

### The AI Transparency Gap: What Users Don't Know Can Hurt You
*Source: VKTR*
...adopt privacy-preserving technologies such as differential privacy or fine-tuning with synthetic data... as Priyanshu explained.
[Read more](https://www.vktr.com/ai-technology/the-ai-transparency-gap-what-users-dont-know-can-hurt-you/)

### Bypassing OpenAI's Structured Outputs: Another Simple Jailbreak - Aman Priyanshu
*Source: Cisco Blogs*
ENUM-based attack achieved an ASR of 52.89%, compared to 12.44% for normal API calling and 15.78% for function... - Aman
[Read more](https://blogs.cisco.com/security/bypassing-openais-structured-outputs-another-simple-jailbreak)

### The gpt-oss Blossom
*Source: Equinox IT*
Aman Priyanshu and Supriti Vijay analysed expert activations in gpt-oss-20b and pruned under-utilised experts across domain-specialised variants... With this work I imagine pruning will get more attention...
[Read more](https://www.equinox.co.nz/blog/the-gpt-oss-blossom)

### PEPR '24 - Through the Lens of LLMs: Unveiling Differential Privacy Challenges
*Source: USENIX YouTube Channel*
...into their isolated clusters consistently, meaning users who have said interests... can be more easily reidentified. - Aman
[Read more](https://www.youtube.com/watch?v=WylO_cAOw9g)

### Meta's AI safety model vulnerable to simple space bar trick
*Source: News Bytes*
Aman Priyanshu, a bug hunter with enterprise AI application security firm Robust Intelligence, discovered this safety bypass...
[Read more](https://www.newsbytesapp.com/news/science/meta-s-latest-ai-safety-model-is-not-foolproof/story)

### Meta Prompt Guard Is Vulnerable to Prompt Injection Attacks
*Source: Bank Info Security*
"The bypass involves inserting character-wise spaces between all English alphabet characters in a given prompt..." - Aman
[Read more](https://www.bankinfosecurity.com/researchers-prompt-injection-attack-metas-prompt-guard-a-25886)

### Reasoning Models: An Introduction to More Logical Models
*Source: Cisco & TILOS Faculty Talks*
Exploring reasoning models, their training, common pitfalls, and solutions as an AI Researcher @ Cisco.
[Read more](https://tilos.ai/tilos-cisco-workshop-on-ai-security/#:~:text=Cisco%20and%20TILOS%20Faculty%20Talks%0ATBA%0A%C2%A0%C2%A0%C2%A0Aman%20Priyanshu%2C%20Cisco%20AI%20Safety%20and%20Privacy)

### Paving your Research Journey - A guide to undergraduate research
*Source: The Research Society MIT*
Sharing insights on academic paper structure, journal selection, etc. for emerging researchers at RSM (AI Sub-Head).
[Read more](https://www.youtube.com/watch?v=7wdH6-XxIRM)

### Study Shows Meta AI Safety System Easily Compromised
*Source: ChannelE2E*
...the need for a multi-layer approach," said Robust Intelligence AI Security Researcher Aman Priyanshu.
[Read more](https://www.channele2e.com/brief/study-shows-meta-ai-safety-system-easily-compromised)

### Brewing Brilliance: Hackathons, Research, and Life with Aman Priyanshu | Koffee Conversation @TEIF
*Source: The Koffee Conversation Show*
...got into privacy preserving ML optimization @ Eder Labs and AI Security @ Robust Intelligence...
[Read more](https://www.youtube.com/watch?v=GBBmd9YhE00)

### Bypassing Meta's LLaMA Classifier: A Simple Jailbreak - Aman Priyanshu
*Source: Cisco Blogs*
Analysis uncovered that single-character tokens, especially English alphabet characters, showed minimal changes...
[Read more](https://blogs.cisco.com/security/bypassing-metas-llama-classifier-a-simple-jailbreak)

---

## Publications

### [Through the Lens of LLMs: Unveiling Differential Privacy Challenges](https://www.usenix.org/conference/pepr24/presentation/priyanshu)
**Year:** 2024
Aman Priyanshu, Yash Maurya, Vy Tran, Suriya Ganesh Ayyamperumal
USENIX Conference on Privacy Engineering Practice and Respect

### [Guarding Your Social Circle: Strategies to Protect Key Connections and Edge Importance](https://www.hindawi.com/journals/scn/2023/2548962/)
**Year:** 2023
Nisha P Shetty, Balachandra Muniyal, Akshat Dokania, Sohom Datta, Manas Subramanyam Gandluri, Leander Melroy Maben, Aman Priyanshu
Security and Communication Networks

### [FedBully: A Cross-Device Federated Approach for Privacy Enabled Cyber Bullying Detection using Sentence Encoders](https://journals.riverpublishers.com/index.php/JCSANDM/article/view/16209)
**Year:** 2023
Nisha P Shetty, Balachandra Muniyal, Aman Priyanshu, Vedant Rishi Das
Journal of Cyber Security and Mobility

### [Are Chatbots Ready for Privacy-Sensitive Applications? An Investigation into Input Regurgitation and Prompt-Induced Sanitization](https://arxiv.org/abs/2305.15008)
**Year:** 2023
Aman Priyanshu, Supriti Vijay, Ayush Kumar, Rakshit Naidu, Fatemehsadat Mireshghallah
Pre-Print (In-Submission)

### [#maskUp: Selective Attribute Encryption for Sensitive Vocalization for English language on Social Media Platforms](https://arxiv.org/abs/2211.08653)
**Year:** 2022
Aman Priyanshu, Supriti Vijay
Research & Reports Track at #ShowYourSkill, Coursera

### [NERDA-Con: Extending NER models for Continual Learning - Integrating Distinct Tasks and Updating Distribution Shifts](https://arxiv.org/abs/2206.14607)
**Year:** 2022
Supriti Vijay, Aman Priyanshu
Updatable Machine Learning Workshop, ICML 2022

### [ARLIF-IDS: Attention augmented Real-Time Isolation Forest Intrusion Detection System](https://arxiv.org/abs/2204.09737)
**Year:** 2022
Aman Priyanshu, Sarthak Shastri, Sai Sravan Medicherla
43rd IEEE Symposium on Security and Privacy

### [Finding an elite feature for (D)DoS fast detection-Mixed methods research](https://doi.org/10.1016/j.compeleceng.2022.107705)
**Year:** 2022
Josy Elsa Varghese, Balachandra Muniyal, Aman Priyanshu
Computers & Electrical Engineering, Volume 98

### [Efficient Hyperparameter Optimization for Differentially Private Deep Learning](https://arxiv.org/abs/2108.03888)
**Year:** 2021
Aman Priyanshu, Rakshit Naidu, Fatemehsadat Mireshghallah, Mohammad Malekzadeh
Privacy Preserving Machine Learning Workshop, ACM CCS 2021

### [Something Something Hota Hai! An Explainable Approach towards Sentiment Analysis on Indian Code-Mixed Data](https://aclanthology.org/2021.wnut-1.48/)
**Year:** 2021
Aman Priyanshu, Aleti Vardhan, Sudarshan Sivakumar, Supriti Vijay, Nipuna Chhabra
Workshop on Noisy User-generated Text (W-NUT), EMNLP 2021

### [When Differential Privacy Meets Interpretability: A Case Study](https://arxiv.org/abs/2106.13203)
**Year:** 2021
Rakshit Naidu, Aman Priyanshu, Aadith Kumar, Sasikanth Kotti, Haofan Wang, Fatemehsadat Mireshghallah
Responsible Computer Vision Workshop, CVPR 2021 & Privacy Preserving Machine Learning Workshop, ACM CCS 2021

### [Continual Distributed Learning for Crisis Management](https://arxiv.org/abs/2104.12876)
**Year:** 2021
Aman Priyanshu, Mudit Sinha, Shreyans Mehta
3rd Workshop on Continual and Multimodal Learning for Internet of Things, IJCAI 2021

### [FedPandemic: A Cross-Device Federated Learning Approach Towards Elementary Prognosis of Diseases During a Pandemic](https://arxiv.org/abs/2104.01864)
**Year:** 2021
Aman Priyanshu, Rakshit Naidu
Machine Learning for Preventing and Combating Pandemics & Distributed and Private Machine Learning Workshops, ICLR 2021

### [Stance Classification with Improved Elementary Classifiers Using Lemmatization (Grand Challenge)](https://www.doi.org/10.1109/BigMM50055.2020.00077)
**Year:** 2020
Aman Priyanshu, Vedant Rishi Das, Shashank Rajiv Moghe, Harsh Rathod, Sai Sravan Medicherla, Mini Shail Chhabra, Sarthak Shastri
IEEE Sixth International Conference on Multimedia Big Data (BigMM)

---

## Curated Blogs

### [Breaching Privacy in Real-World Synthetic Data](https://amanpriyanshu.github.io/SynthLeak/)
We cracked a real-world differentially private synthetic data by linking public information to exposed PII overnight.

### [AdaptKeyBERT: Zero-Shot & Few-Shot Keyword Extraction Library](https://amanpriyanshu.github.io/blogs/posts/2024/adaptkeybert/)
We built a keyword extractor, forgot about it, and somehow researchers are actually using it in their work.

### [A Journey into Dynamic Topic Modeling](https://amanpriyanshu.github.io/blogs/posts/2024/dynamic-topic-modeling/)
I created a hierarchical topic modeling dataset from RedPajama, with 100k samples and 3 levels of topics.

### [LinearCosine: "Do we really need multiplication for AI?"](https://amanpriyanshu.github.io/blogs/posts/2024/linear-cosine/)
I created a hierarchical topic modeling dataset from RedPajama, with 100k samples and 3 levels of topics.

### [API-LLM-Hub: LLM-API integration for Static Pages](https://amanpriyanshu.github.io/blogs/posts/2024/api-llm-hub/)
I built a vanilla JavaScript library that lets you use AI APIs directly in browsers, no backend needed.

### [YC-Dendrolinguistics: Linguistic Trees of YC Pitches](https://amanpriyanshu.github.io/blogs/posts/2024/startup-linguistic-trees/)
I mapped linguistic patterns in YC startup pitches like growing trees, and built a semantic search tool to explore them.

### [FRACTURED-SORRY-Bench: Multi-shot prompt injections](https://amanpriyanshu.github.io/blogs/posts/2024/fractured-sorry-bench/)
We broke AI safeguards by splitting harmful prompts into innocent sub-questions.

---

## Experience

### [AI Researcher](https://fdtn.ai)
**Organization:** Foundation-AI (Cisco)
**Duration:** Feb 2025 - Present

### [AI Security Researcher](https://www.cisco.com/site/us/en/products/security/ai-defense)
**Organization:** Cisco (AI Defense)
**Duration:** Jan 2025 - Feb 2025

### [AI Security Research Intern](https://www.robustintelligence.com)
**Organization:** Robust Intelligence
**Duration:** Jun 2024 - Aug 2024

### [Founding Member & AI-Lead](https://msports.ai)
**Organization:** MyCelium Sports (Course: 11-681)
**Duration:** Jan 2024 - May 2024

### [Privacy Engineering Independent Study](https://www.normsadeh.org/)
**Organization:** Under Professor Norman Sadeh at CMU
**Duration:** Aug 2023 - Apr 2024

### [Research Project Lead & Contributor](https://openmined.org/)
**Organization:** OpenMined
**Duration:** Mar 2023 - Aug 2023

### [AAAI Undergraduate Consortium Scholar](https://aaai-uc.github.io/2023_scholars.html#aman-priyanshu)
**Organization:** The Association for the Advancement of Artificial Intelligence
**Duration:** Feb 2023

### [Co-Founder](http://felasa-initiative.github.io/)
**Organization:** Felasa Initiative (Open-Source Women's Safety Awareness Initiative)
**Duration:** Aug 2022 - Present

### [Privacy Engineer Intern](https://www.eder.io/)
**Organization:** Eder Labs R&D Private Limited, Delaware, USA
**Duration:** Aug 2022 - Aug 2023

### [MITACS Research Intern](https://www.concordia.ca/)
**Organization:** Concordia University, Quebec, Canada
**Duration:** May 2022 - Aug 2022

### [Federated Learning Intern](https://www.dynamofl.com/)
**Organization:** DynamoFL, California, USA
**Duration:** March 2022 - May 2022

### [Undergraduate Research Assistant](https://manipal.edu/mit.html)
**Organization:** Manipal Institute of Technology, Karnataka, India
**Duration:** May 2021 - Jun 2023

### [Expertise Sub-Head, Artificial Intelligence](https://www.researchsocietymit.com/)
**Organization:** Research Society Manipal, Karnataka, India
**Duration:** Feb 2021 - Dec 2022

### [Technical Head](https://cryptonite.team/index.html)
**Organization:** Cryptonite Student Project
**Duration:** June 2021 - Dec 2022

### Machine Learning and Web Crawling Intern
**Organization:** Oniria Pets, Poland
**Duration:** Jan 2020 - Feb 2020

---

## Education

### Carnegie Mellon University
**Degree:** MSIT in Privacy Engineering
Key Courses: Prompt Engineering (17730), AI Governance (17716), Deep Learning (11785), Computer Technology Law (17562), Differential Privacy (17731), Information Security (17631), & Usability (17734)

### Manipal Institute of Technology
**Degree:** B.Tech in Information Technology
Key Courses: Data Structures and Algorithms, Design and Analysis of Algorithms, Object Oriented Programming, Probability and Statistics, Computer Networks, Operating Systems, Database Management

---

## Relevant Projects

### [ProTaska-GPT](https://pypi.org/project/ProTaska-GPT/)
June 2023
Specify your dataset of choice, and ProTaska-GPT will understand the dataset with tasks, tutorials, and actionable insights for it. Accelerate your data science journey with ease and efficiency! (Meant for people starting their journey into Data Science.)

### [AdaptKeyBERT](https://pypi.org/project/adaptkeybert/)
October 2022
Built a python library, integrating semi-supervised attention for creating a few-shot & zero-shot domain adaptation technique for keyphrase extraction.

### [DP-SDV](https://github.com/AmanPriyanshu/DPSDV)
June 2022
Creating a Differential Privacy securing Synthetic Data Generation for tabular, relational and time series data.

### [NERDA-Con](https://pypi.org/project/NERDA-Con/)
May 2022
NERDA-Con is a python package, a pipeline for training Named Entity Recognition (NER) with Large Language Models bases by incorporating the concept of Elastic Weight Consolidation (EWC) into the NER fine-tuning NERDA pipeline.

### [DP-HyperparamTuning](https://github.com/AmanPriyanshu/DP-HyperparamTuning)
August 2021
DP-HyperparamTuning offers an array of tools for fast and easy hypertuning of various hyperparameters for the DP-SGD algorithm. We proposed a novel, customizable reward function that allows users to define a single objective function for establishing their desired privacy-utility tradeoff.

### [Hexa Lite](https://github.com/AmanPriyanshu/HexaLite)
August 2021
Created an unsupervised machine learning to extract contextually similar texts. The project was used in indexing Academic Literature, Law Precedents, and Financial Records. The project won Code Innovation Series - a Hackathon in association with GitHub.

### [Augmented Face Detection API](https://github.com/sarthak815/Face-Detection_Model-HackRx2.0)
July 2021
The app performs obstruction detection, spoof detection, blur detection and environment approval. Utilized Deep Neural Networks and Genetic Algorithms to achieve these goals in low computational time. The project won 1st place in HackRx 2.0 by Bajaj Finserv.

### [DeCrise](https://devpost.com/software/decrisis)
May 2021
DeCrise is an online platform that acts as an aggregator for public support/utility services which uses continual-federated-learning to create a quick response information retrieval system during a natural disaster. The project won 1st place in The ACM UCM Datathon.

### [Voix](https://devpost.com/software/voix)
April 2021
A social-media platform employing machine learning and differential privacy to promote civic engagement while protecting user-privacy. The project won under the Community & Civic Engagement for UC Berkeley's CalHacks Hackathon.

---

## Achievements

### [Strong Compute Hackathon Winner - ARC AGI Track](https://www.linkedin.com/posts/sanika-chavan_outthinking-the-arc-im-thrilled-to-ugcPost-7319811639141113856-oTRK)
April 2025
Won the ARC AGI Track with a multi-stage reasoning framework for VLLMs using custom token blocks and synthetic reasoning chains to induce planning-based reasoning, achieving 75+% resolution rate on the training set.

### [Spark Grant Winner - NOVA Hacks](https://www.linkedin.com/posts/suriya-ganesh_today-we-received-a-grant-of-1k-as-part-activity-7180045427470127104-R5H_)
March 2024
Won the Spark Grant for our app that enhances speech for non-native English speakers, employing prompt-engineering function-calling (OpenAI GPT4/3.5) and Speech-to-Text (OpenAI Whisper), with features like audio-segmentation, speaker-recognition, and diarization.

### [Space Theme Category Winner - HackCMU](https://www.linkedin.com/feed/update/urn:li:activity:7127346447670222850/)
September 2023
Won the Space-Themed track with our space trash collection project using Pareto optimization to balance time, fuel requirements, satellite movements, planetary alignment, and the trajectory of trash collectors for predicting monetary incentives.

### [Research & Travel Grant - AAAI Undergraduate Consortium Scholar](https://aaai-uc.github.io/2023_scholars.html#aman-priyanshu)
February 2023
Selected as one of twelve individuals for the AAAI UC program, recognizing my research on Privacy and Fairness.

### [Second Runners-Up - ShowYourSkill (Coursera)](https://drive.google.com/file/d/1RLZHDSceNQRKHYpQWOXHcDxoBvC1Er26/view?usp=sharing)
June 2022
Came second runners-up in #ShowYourSkill where we participated in the Research & Reports Track and creating a NLP augmented Machine Learning Application for women safety.

### [Runners-Up - BobHacks 2021](https://drive.google.com/file/d/1sFSGr4Qj3KBNhL1fjyhDTTKjW2jpEY0l/view?usp=sharing)
September 2021
Came runners-up in BobHacks where we built a pattern recognition API built on top of the MetaBob API. The API is able to assist users in tracking common errors and delivers pattern recognition on the MetaBob API.

### [First Prize - Code Innovation Series](https://drive.google.com/file/d/1me7n4Qst9fB9e4CC3OtFlh3Z9QLPfZoy/view?usp=sharing)
August 2021
Innovation Series Hackathon was organized by Manipal Institute of Technology. Employed Document-Embedding for measuring contextual similarity between multiple pages and given search-queries.

### [First Prize - HackRx by Bajaj Finserv](https://drive.google.com/file/d/1w6aqgskGyGztPpRG9ER3Gn6lhnhcYr3G/view?usp=sharing)
July 2021
Used Deep Learning and Classical Image processing to achieve a face verification and profile-rank estimation task. The methodology out-performed classic Deep Learning methods.

### [First Prize - ACM UCM Datathon](https://devpost.com/software/decrisis)
May 2021
Built DeCrise, an online platform that acts as an aggregator for public support/utility services for fast-response during a major crisis or disaster.

### [First Prize - CalHacks Hackathon](https://devpost.com/software/voix)
April 2021
Won under the Community & Civic Engagement track. Built Voix, an anonymous platform for uplifting communities and promoting civic participation using privacy-enabled machine learning.

### [Runners-Up - Furniture Identification](https://www.kaggle.com/competitions/day-3-kaggle-competition/leaderboard)
September 2020
Employed skip-connections to generate high-performance model for furniture identification in IECSE x VISION competition.

### [Runners-Up - IEEE BigMM Data Challenge](https://www.kaggle.com/c/ieee-bigmm-data-challenge/leaderboard)
August 2020
Came runners-up in IEEE Grand-Challenge for harassment detection on tweets. Used Elementary Classifiers for Sentiment Analysis. The team was invited to present at IEEE BigMM conference.

### [Intel Edge AI Scholarship Recipient](https://drive.google.com/file/d/1RNC2MpG5DY6orJtmBZBHa2Gm2v8MO4CV/view?usp=sharing)
January 2020
Selected as one of the recipients of the Intel Edge AI Scholarship Program. Learned about Machine Learning Implementation on the Edge.

---

## Interactive Tools/Demos & Games

### [Federated Learning Hyperparam Tuning Game](https://amanpriyanshu.github.io/FL-Interactive-Game/)
Understand and play with federated learning hyperparams! In-browser tensorflow-js simulation of FedAvg to understand and gain intuition about IID and Non-IID Federated Learning settings.

### [Differentially Private Tetris](https://amanpriyanshu.github.io/Differentially-Private-Tetris/)
A unique twist on classic Tetris where players manage a privacy budget to reveal blocks, demonstrating differential privacy concepts through gameplay. Experience privacy-utility tradeoffs in an engaging way.

### [The Unlearning Protocol](https://amanpriyanshu.github.io/The-Unlearning-Protocol/)
An interactive game exploring machine learning unlearning and fairness concepts. Players select data points that least impact the dataset, providing hands-on experience with data removal and model fairness considerations.

---

*© 2024 Aman Priyanshu. All rights reserved.*