GRAG-LLAMA-3.1-8B (German Retrieval Augmented Generation)
Here you can find all the final checkpoints & datasets from training Llama-3.1-8B Model from Meta on the GRAG Datasets.
Question Answering • Updated • 22Note This model was merged from the SFT- & ORPO Checkpoints. SFT Model with 60% weights & ORPO Model with 40% weights.
avemio/GRAG-LLAMA-3.1-8B-ORPO-HESSIAN-AI
Question Answering • Updated • 108 • 2Note This model was trained on 20.7 Million Tokens in ORPO (Odd-Ratio-Preference Optimization) on synthetically generated or enhanced Data. Please see the GRAG-ORPO-Dataset (https://huggingface.co/datasets/avemio/GRAG-ORPO-ShareGPT-HESSIAN-AI) for reference.
avemio/GRAG-LLAMA-3.1-8B-SFT-HESSIAN-AI
Question Answering • Updated • 87Note This model was trained on 1,5 Billion Tokens in SFT(Supervised Fine-Tuning) on synthetically generated or enhanced Data. Please see the GRAG-SFT-Dataset (https://huggingface.co/datasets/avemio/GRAG-SFT-ShareGPT-HESSIAN-AI) for reference.
avemio/GRAG-LLAMA-3.1-8B-CPT-HESSIAN-AI
Question Answering • UpdatedNote This model was trained on 507,5 Million Tokens in CPT (Continued Pre-Training) on synthetically generated or enhanced Data. Please see the GRAG-CPT-Dataset (https://huggingface.co/datasets/avemio/GRAG-CPT-HESSIAN-AI) for reference.
avemio/GRAG-ORPO-ShareGPT-HESSIAN-AI
Viewer • Updated • 13.7k • 2avemio/GRAG-SFT-ShareGPT-HESSIAN-AI
Viewer • Updated • 1.01M • 2 • 1avemio/GRAG-CPT-HESSIAN-AI
Viewer • Updated • 654k • 2avemio/GRAG-LLAMA-3.1-8B-MERGED-HESSIAN-AI-Q8_0-GGUF
Question Answering • Updated • 19avemio/GRAG-LLAMA-3.1-8B-ORPO-HESSIAN-AI-Q8_0-GGUF
Question Answering • Updated • 23avemio/GRAG-LLAMA-3.1-8B-SFT-HESSIAN-AI-Q8_0-GGUF
Question Answering • Updated • 21