Code and documents of LongLoRA and LongAlpaca (ICLR 2024 Oral)
-
Updated
Jun 2, 2024 - Python
Code and documents of LongLoRA and LongAlpaca (ICLR 2024 Oral)
Official release of InternLM2 7B and 20B base and chat models. 200K context support
This hands-on walks you through fine-tuning an open source LLM on Azure and serving the fine-tuned model on Azure. It is intended for Data Scientists and ML engineers who have experience with fine-tuning but are unfamiliar with Azure ML.
Develop a Romanian legal domain Large Language Model (LLM) using pre-trained model and fine-tuning on legal texts. The fine-tuned model is available on Hugging Face.
This repo contains everything about transformers and NLP.
LegalDigest - NLP Project
Fine-tuning Mistral LLM for Adaptive Machine Translation
Comprehensive Compilation of Customized LLMs for Specific Domains and Industries
This repository implements a self-updating RAG (Retrograde Autoregressive Generation) model. It leverages Wikipedia for factual grounding and can fine-tune itself when information is unavailable. This allows the model to continually learn and adapt, offering a dynamic and informative response.
AM (Advanced Mathematics) Chat is a large language model that integrates advanced mathematical knowledge, exercises in higher mathematics, and their solutions. AM (Advanced Mathematics) chat 高等数学大模型。一个集成数学知识和高等数学习题及其解答的大语言模型。
[ACL2024 Findings] Agent-FLAN: Designing Data and Methods of Effective Agent Tuning for Large Language Models
Pre-Training and Fine-Tuning transformer models using PyTorch and the Hugging Face Transformers library. Whether you're delving into pre-training with custom datasets or fine-tuning for specific classification tasks, these notebooks offer explanations and code for implementation.
Stumble upon a fine tuning that is unfathomable.
Tune LLM in few lines of code
Fine-Tuning and Evaluating a Falcon 7B Model for generating HTML code from input prompts.
Fine-tune ChatGPT with few-shot learning for personalized resume bullet points.
Exploring the potential of fine-tuning Large Language Models (LLMs) like Llama2 and StableLM for medical entity extraction. This project focuses on adapting these models using PEFT, Adapter V2, and LoRA techniques to efficiently and accurately extract drug names and adverse side-effects from pharmaceutical texts
A data-centric AI package for ML/AI. Get the best high-quality data for the best results. Discord: https://discord.gg/t6ADqBKrdZ
MLX Institute | Fine-tuning Llama-2 7B on The Onion to generate new satirical articles given a headline
Add a description, image, and links to the fine-tuning-llm topic page so that developers can more easily learn about it.
To associate your repository with the fine-tuning-llm topic, visit your repo's landing page and select "manage topics."