πͺπΊβοΈ EU AI Act: Systemic Risks in the First CoP Draft Comments βοΈπͺπΊ Dec 12, 2024 β’ 12
EU Training Data Transparency: A Proposal for a Sufficiently Detailed Summary πππΌοΈπͺπΊ Jul 3, 2024 β’ 8
Ethics and Society Newsletter #6: Building Better AI: The Importance of Data Quality Jun 24, 2024 β’ 33
Policy Questions Blog 1: AI Data Transparency Remarks for NAIAC Panel ππβοΈ Mar 27, 2024 β’ 2
π Training Data Transparency in AI: Tools, Trends, and Policy Recommendations π³οΈ Dec 5, 2023 β’ 1
Introducing IDEFICS: An Open Reproduction of State-of-the-art Visual Language Model Aug 22, 2023 β’ 28
ZeroGPU Spaces Collection ZeroGPU Spaces made by the community β’ 17 items β’ Updated Jun 6, 2024 β’ 233
Frugal AI Challenge Tasks Collection Find the 3 datasets for the Frugal AI Challenge in this Collection! π Find all the details of the challenge at https://frugalaichallenge.org/ β’ 7 items β’ Updated 9 days ago β’ 14
ModernBERT Collection Bringing BERT into modernity via both architecture changes and scaling β’ 3 items β’ Updated 27 days ago β’ 123
view article Article Finding Moroccan Arabic (Darija) in Fineweb 2 By omarkamali β’ Dec 8, 2024 β’ 21
view article Article πͺπΊβοΈ EU AI Act: Systemic Risks in the First CoP Draft Comments βοΈπͺπΊ By yjernite β’ Dec 12, 2024 β’ 12
OLMo 2 Collection Artifacts for the second set of OLMo models. β’ 22 items β’ Updated 9 days ago β’ 74
view article Article Letβs make a generation of amazing image generation models By burtenshaw β’ Nov 26, 2024 β’ 34
view article Article Releasing the largest multilingual open pretraining dataset By Pclanglais β’ Nov 13, 2024 β’ 98
SmolLM2 Collection State-of-the-art compact LLMs for on-device applications: 1.7B, 360M, 135M β’ 15 items β’ Updated 24 days ago β’ 199
2024 Interconnects Artifacts Collection Models & datasets mentioned in the bottom section of posts! β’ 280 items β’ Updated 13 days ago β’ 6
FLAIR models : landcover semantic segmentation Collection The FLAIR models is a collection of semantic segmentation models initially developed to classify land cover on very high resolution aerial imagery. β’ 9 items β’ Updated Jun 19, 2024 β’ 12
Pangea Collection A Fully Open Multilingual Multimodal LLM for 39 Languages β’ 18 items β’ Updated Nov 2, 2024 β’ 18
view article Article Democratization of AI, Open Source, and AI Auditing: Thoughts from the DisinfoCon Panel in Berlin By frimelle β’ Oct 8, 2024 β’ 6
Moshi v0.1 Release Collection MLX, Candle & PyTorch model checkpoints released as part of the Moshi release from Kyutai. Run inference via: https://github.com/kyutai-labs/moshi β’ 13 items β’ Updated Sep 18, 2024 β’ 225
view article Article Getty Images Brings High-Quality, Commercially Safe Dataset to Hugging Face By andreagagliano β’ Sep 6, 2024 β’ 16
Qwen2-VL Collection Vision-language model series based on Qwen2 β’ 16 items β’ Updated Dec 6, 2024 β’ 190