Flax Community

non-profit

https://github.com/huggingface/transformers/tree/master/examples/research_projects/jax-projects

AI & ML interests

JAX, Flax, TPU, 🤗

authored a paper 15 days ago

Beyond LLM-as-a-Judge: Deterministic Metrics for Multilingual Generative Text Evaluation

Paper • 2604.05083 • Published 24 days ago

authored 4 papers 28 days ago

Contrastive Representation Learning: A Framework and Review

Paper • 2010.05113 • Published Oct 10, 2020 • 1

NeurIPS 2025 E2LM Competition : Early Training Evaluation of Language Models

Paper • 2506.07731 • Published Jun 9, 2025 • 2

Falcon-H1: A Family of Hybrid-Head Language Models Redefining Efficiency and Performance

Paper • 2507.22448 • Published Jul 30, 2025 • 71

Falcon Perception

Paper • 2603.27365 • Published Mar 28 • 14

submitted a paper to Daily Papers about 1 month ago

Composer 2 Technical Report

Paper • 2603.24477 • Published Mar 25 • 15

authored 2 papers about 1 month ago

Fanar-Sadiq: A Multi-Agent Architecture for Grounded Islamic QA

Paper • 2603.08501 • Published Mar 9

What Really Controls Temporal Reasoning in Large Language Models: Tokenisation or Representation of Time?

Paper • 2603.19017 • Published Mar 19 • 3

submitted 2 papers to Daily Papers about 1 month ago

What Really Controls Temporal Reasoning in Large Language Models: Tokenisation or Representation of Time?

Paper • 2603.19017 • Published Mar 19 • 3

Fanar-Sadiq: A Multi-Agent Architecture for Grounded Islamic QA

Paper • 2603.08501 • Published Mar 9

authored 2 papers 4 months ago

From RAG to Agentic RAG for Faithful Islamic Question Answering

Paper • 2601.07528 • Published Jan 12 • 4

Prototypicality Bias Reveals Blindspots in Multimodal Evaluation Metrics

Paper • 2601.04946 • Published Jan 8

authored 2 papers 5 months ago

On Space Folds of ReLU Neural Networks

Paper • 2502.09954 • Published Feb 14, 2025

The Space Between: On Folding, Symmetries and Sampling

Paper • 2503.08502 • Published Mar 11, 2025

authored a paper 7 months ago

Robot Learning: A Tutorial

Paper • 2510.12403 • Published Oct 14, 2025 • 130

authored 2 papers 7 months ago

MixtureVitae: Open Web-Scale Pretraining Dataset With High Quality Instruction and Reasoning Data Built from Permissive-First Text Sources

Paper • 2509.25531 • Published Sep 29, 2025 • 10

BigCodeArena: Unveiling More Reliable Human Preferences in Code Generation via Execution

Paper • 2510.08697 • Published Oct 9, 2025 • 39

authored a paper 7 months ago

Distributional Semantics Tracing: A Framework for Explaining Hallucinations in Large Language Models

Paper • 2510.06107 • Published Oct 7, 2025 • 3

in flax-community/roberta-base-mr 9 months ago

Adding `safetensors` variant of this model

#1 opened about 1 year ago by

authored a paper 10 months ago

A Multi-Task Benchmark for Abusive Language Detection in Low-Resource Settings

Paper • 2505.12116 • Published May 17, 2025