GLiNER2: Unified Schema-Based Information Extraction

adsharma · 2026-03-06T04:22:28 1772770948

Feels like it's written by ML people not following python software engineering practices.

No black, UV or ruff.

Prints messages with emojis to stdout by default.

Makes a connection to hugging face on every import.

https://github.com/fastino-ai/GLiNER2/pull/74

fbilhaut · 2026-03-06T10:48:13 1772794093

GLiNER is a really great research work. But putting this kind of things in production is just another job. Not trying to do self promotion here, but there are alternatives for this purpose, like gline-rs (https://github.com/fbilhaut/gline-rs). Support of GLiNER 2 models is on the way.

adsharma · 2026-03-06T15:00:14 1772809214

Any chance you could wrap this in pyo3? There is a large python market for this.

iwhalen · 2026-03-05T22:46:30 1772750790

Very cool stuff. Love the focus on CPU-first.

Would also love to see some throughput numbers on basic VM setup.

Edit: there are some latency numbers in the paper https://arxiv.org/pdf/2507.18546

deepsquirrelnet · 2026-03-05T22:45:29 1772750729

Zero-shot encoder models are so cool. I'll definitely be checking this out.

If you're looking for a zero-shot classifier, tasksource is in a similar vein.

https://huggingface.co/tasksource/ModernBERT-large-nli

plaguna · 2026-03-06T06:30:10 1772778610

Is this only for text I guess? What if the documents are in PDF? What is the recommendation to transform PDF to text?

akreal · 2026-03-06T11:25:24 1772796324

Docling: https://github.com/docling-project/docling

snthpy · 2026-03-06T05:57:29 1772776649

This looks great. Thank you!

hbcondo714 · 2026-03-05T22:13:42 1772748822

There is another version at:

https://github.com/urchade/GLiNER

Looks like it’s still being maintained too?

adsharma · 2026-03-06T04:19:47 1772770787

Use Gliner2. Much better model.