Teaching AI to Read and Group Like I Bookmark the Web: A Journey into Dynamic Topic Modeling

Quick Links: Dataset on HuggingFace The Topic Modeling Challenge You know that feeling when you have 50 browser tabs open, and you’re desperately trying to organize them into bookmark folders? “ML Papers To Read,” “Funny Cat Videos,” “Recipes I’ll Never Make”… We all have our system. And apparently, it’s such a universal problem that every tech company is launching their own solution - Arc Browser with its “Spaces,” Chrome with its tab groups, and about 500 extensions promising to color-code your digital hoarding habits into submission....

November 11, 2024 | 4 min | Aman Priyanshu

Contra-Topic-bottleneck-t5: Efficient Topic Extraction Without the Computational Overhead

Quick Links: Model on HuggingFace | Interactive Demo When it comes to topic extraction, the AI world seems fixated on massive models and expensive compute. But what if there was a simpler way? 🤔 The Genesis: Simplicity Through Linear Transformation Picture this: There I was, looking for an open-source solution to extract topics from text at scale. The available options were either massive language models or complex fine-tuning pipelines. That’s when it hit me – what if we could leverage the semantic structure of existing embeddings with just a linear transformation?...

November 6, 2024 | 3 min | Aman Priyanshu

AdaptKeyBERT: Stumbling Through Two Years of Keyword Extraction

Quick links (in case you want to skip my ramblings): PyPI Package GitHub Repository Alright, gather ‘round, word enthusiasts and syntax sorcerers! 🧙‍♂️📚 Remember that time you tried to explain machine learning to your grandma and ended up comparing neural networks to her knitting patterns? Well, buckle up, because we’re about to dive into a similar realm of “What was I thinking?” – the saga of AdaptKeyBERT. It’s been two trips around the sun since I cobbled together this quirky little keyword extractor and sent it off into the wild world of NLP....

September 22, 2024 | 3 min | Aman Priyanshu

API-LLM-Hub: Simplifying LLM-API integration for Static Pages

Hey there, fellow code enthusiasts and AI wranglers! 🖐️🤖 You know that feeling when you’re knee-deep in a project, trying to get multiple AI models to play nice in your browser? Yeah, I’ve been there. Cue the frustrated sighs, the endless searches over GitHub issues 😢, and the “why-isn’t-this-working” hair-pulling sessions. LINK-TO-PACKAGE just in case you wanna skip the deets After one too many nights wrestling with backends, CORS issues, and the general chaos of integrating various AI APIs, I decided enough was enough....

September 18, 2024 | 4 min | Aman Priyanshu

FRACTURED-SORRY-Bench: Unraveling AI Safety through Decomposing Malicious Intents

Hello, fellow AI enthusiasts! 🤖 Today, I wanted to dive into the FRACTURED-SORRY-Bench framework and dataset we just released. Check out the dataset, website, and github for the dataset! The FRACTURED-SORRY Saga: A Tale of Adaptation and Decomposition Picture this: you’re wandering through the lush collection of prompt-injection and llm-red-teaming papers, marveling at some of the weird and some of the crazier attack mechanisms that have been released recently. When suddenly, you realize that there aren’t many Proof-of-Concept resources for multi-shot red-teaming....

August 28, 2024 | 3 min | Aman Priyanshu