Join the conversation

Join the community of Machine Learners and AI enthusiasts.

Sign Up

All HF Hub posts

MikeDoes 
posted an update 2 days ago
view post
Post
1744
Anonymizing a prompt is half the battle. Reliably de-anonymizing the response is the other.

To build a truly reliable privacy pipeline, you have to test it. A new Master's thesis does just that, and our data was there for every step.

We're excited to showcase this work on handling confidential data in LLM prompts from Nedim Karavdic at Mälardalen University. To build their PII anonymization pipeline, they first trained a custom NER model. We're proud that the Ai4Privacy pii-masking-200k dataset was used as the foundational training data for this critical first step.

But it didn't stop there. The research also used our dataset to create the parallel data needed to train and test the generative "Seek" models for de-anonymization. It's a win-win when our open-source data not only helps build the proposed "better solution" but also helps prove why it's better by enabling a rigorous, data-driven comparison.

🔗 Check out the full thesis for a great deep-dive into building a practical, end-to-end privacy solution: https://www.diva-portal.org/smash/get/diva2:1980696/FULLTEXT01.pdf

#OpenSource
#DataPrivacy
#LLM
#Anonymization
#AIsecurity
#HuggingFace
#Ai4Privacy
#Worldslargestopensourceprivacymaskingdataset
Reubencf 
posted an update 3 days ago
view post
Post
2430
As 2025 is ending i would like to thank everyone for trying out
Reubencf/Nano_Banana_Editor

looking forward to build and release more in the future for the open source community

projectlosangeles 
posted an update 2 days ago
di-zhang-fdu 
posted an update 2 days ago
dhruv3006 
posted an update 3 days ago
view post
Post
2413
Git is powerful, but it’s also one of the biggest sources of developer mistakes.

What is Git GUI, and how does it help here ?

Git GUI makes version control visual, predictable, and easier to reason about especially when things go wrong.

That’s exactly why we built Git GUI in Voiden.

Instead of relying on memorized commands, Voiden lets you see what Git is doing before it does it.

What Voiden’s Git GUI helps developers do
• View exact file and line-level changes before committing
• Stage only intended changes (no accidental commits)
• Clearly distinguish staged vs unstaged files
• Inspect visual diffs with full context
• Understand branches, commit history, and repo state instantly

When Git behavior is hidden, errors increase. Voiden’s Git GUI doesn’t abstract Git away, it explains Git.

Whether you’re new to Git or an experienced developer who prefers clarity, this is Git you can reason about.

Version control should feel safe, not stressful.

What Git pain points slow you down today?

Try out Git GUI in beta : https://voiden.md ( Now in Linux and Mac )

  • 2 replies
·
MonsterMMORPG 
posted an update 3 days ago
view post
Post
2010
How to Use SwarmUI Presets & Workflows in ComfyUI + Custom Model Paths Setup for ComfyUI & SwarmUI : https://www.youtube.com/watch?v=EqFilBM3i7s

Full tutorial link > https://www.youtube.com/watch?v=EqFilBM3i7s

Info

Generating workflow inside SwarmUI and using in ComfyUI is literally 1-click. In this tutorial I will show you how to use our 40+ amazing generative AI presets made for SwarmUI in ComfyUI with most easy way. You will be able to get very best outcomes of all AI models such as SDXL, FLUX, Z Image Turbo, Wan 2.1, Wan 2.2, FLUX 2, Qwen Image, Qwen Image Edit, FLUX Kontext, Image Outpainting, Image Inpainting and many more. Moreover, I will show how to use custom model paths in ComfyUI and SwarmUI to unify your models in same folder and avoid model duplication and save massive amount of disk space.

  • 1 reply
·
mike-ravkine 
posted an update about 3 hours ago
view post
Post
31
Happy 2026 everyone!

I've been busy working on some new ranking/position methodologies and excited to start sharing some results.

Plot legends:

- X = truncation rate (low = good)
- ? = confusion rate (low = good)
- blue bars = average completion tokens (low = good)
- black diamonds = CI-banded performance (high = good)
- cluster squares = models inside this group are equivalent

openai/gpt-oss-120b remains the king in all dimensions of interest: truncation rates, completion lengths and performance. If I had but one complaint it's the reason_effort does not seem to actually work - more on this soon.

Second is a 3-way tie in performance between the Qwen3-235B-2507 we all know and love with an unexpected entrant - ByteDance-Seed/Seed-OSS-36B-Instruct

This is a very capable model and it's reasoning effort controls actually works, but you should absolutely not leave it on the default "unlimited" - enable a sensible limit (4k works well for 8k context length).

Third place is another 3-way tie, this one between Seed-OSS-36B (it straddles the CI boundary between 2nd and 3rd place), Qwen/Qwen3-Next-80B-A3B-Instruct (demonstrating that full attention may be overrated after all and gated is the way to go) and the newly released zai-org/GLM-4.7 which offers excellent across the board performance with some of the shortest reasoning traces I've seen so far.
unmodeled-tyler 
posted an update about 23 hours ago
view post
Post
110
NEW MODEL ALERT: vanta-research/atom-27b

Atom-27B has arrived! This model is the largest open-weight model so far from VANTA Research, and is our 4th model in Project Atom - an effort to scale our collaborative Atom persona from 4B-400B+

Atom-27B is based on Google's Gemma 3 27B architecture, and embodies the familiar friendly, warm, and curious persona that appeared in previous releases.

Atom is designed to think WITH you, not FOR you - marking VANTA Research's commitment to building frontier collaborative models.

Check it out!
danielhanchen 
posted an update 1 day ago
dhruv3006 
posted an update 1 day ago
view post
Post
152
API testing needs a reset.

API testing is broken. You test localhost but your collections live in someone's cloud. Your docs are in Notion. Your tests are in Postman. Your code is in Git. Nothing talks to each other.

So we built a solution.

The Stack:

- Format: Pure Markdown (APIs should be documented, not locked)
- Storage: Git-native (Your API tests version with your code)
- Validation: OpenAPI schema validation: types, constraints, composition, automatically validated on every response
- Workflow: Offline-first, CLI + GUI (No cloud required for localhost)

Try it out here: https://voiden.md/