I am trying to fine tune a openllama model with huggingface's peft and lora. I fine tuned the model on a specific dataset. However, the output from the model.generate() is very poor for the given input. When I give a whole sentence form the dataset then it generates related texts, otherwise it is not. Are there any way to improve it?
How to improve the output of fine tuned Open Llama 7b model for text generation?
452 views Asked by Md Tahmid Hasan Fuad At
0
There are 0 answers
Related Questions in LARGE-LANGUAGE-MODEL
- Is it possible to fine tune or use RAG on the CoreML version of Llama2?
- Compare two strings by meaning using LLMs
- Implementation (and working) differences between AutoModelForCausalLMWithValueHead vs AutoModelForCausalLM?
- How do I know the right data format for different LLMs finetuning?
- I am trying to make a product which will reformat the answer using the question and Sql_answer as data
- CUDA OutOfMemoryError but free memory is always half of required memory in error message
- Query with my own data using langchain and pinecone
- Could not find a version that satisfies the requirement python-magic-bin
- Any possibility to increase performance of querying chromadb persisted locally
- Grid based decision making with Llama 2
Related Questions in FINE-TUNING
- loading saved model doesn't behave as expected when finetuning it
- Can I create a fine-tuned model for OpenAI API Codex models?
- Transfer learning (or fine-tuning) pre-trained model on non-text data
- Fine tuning a BERT Model as a chatbot giving error while training
- I have to finetune the below query in Postgres its taking time for fetching the data, can you help Me?
- Do I need to retrain Bert for NER to create new labels?
- How to use GPU for Fine-tuning HuggingSound custom model
- I am attempting to fine-tune the stable diffusion with Dreambooth on myself (my face and body)
- Is validation set necessary when fine-tuning a model using synthetic images?
- Can i clear up gpu vram in colab
Related Questions in LLAMA-INDEX
- Python OpenAI API: Can't instantiate abstract class CustomExtractor with abstract method class_name
- Exceeding LLM's maximum context length even using llama_index PromptHelper
- llama_index PromptHelper not chunking properly
- asyncio.create_task blocks main thread
- Is there a way to use llama-index only on the indexing side?
- How to improve accuracy of the large Milvus Index?
- chatbot that will generate a document draft with python, langchain, and openai
- Mongodb Vector search with llama_index.llms.openai_utils: APIConnectionError: Connection error
- PandasQueryEngine from llama-index is unable to execute code with the following error: invalid syntax (, line 0)
- llama-index: multiple calls to query_engine.query always gives "Empty Response"
Related Questions in PEFT
- perform peft with lora on flan-t5 model causing no executable batch size error
- PEFT LoRA Trainer No executable batch size found
- CUDA out of memory error during PEFT LoRA fine tuning
- Loading a pretrained model is not working, what could be the issue?
- Huggingface peft error message AttributeError: 'Linear8bitLt' object has no attribute 'state'
- I want to merge my PEFT adapter model with the base model and make a fully new model
- safetensors_rust.SafetensorError: Error while deserializing header: HeaderTooLarge
- AutoTrain advanced CLI: error: unrecognized arguments: --fp16 --use-int4
- RuntimeError: "addmm_impl_cpu_" not implemented for 'Half' - PEFT Huggingface trying to run on CPU
- How to improve the output of fine tuned Open Llama 7b model for text generation?
Popular Questions
- How do I undo the most recent local commits in Git?
- How can I remove a specific item from an array in JavaScript?
- How do I delete a Git branch locally and remotely?
- Find all files containing a specific text (string) on Linux?
- How do I revert a Git repository to a previous commit?
- How do I create an HTML button that acts like a link?
- How do I check out a remote Git branch?
- How do I force "git pull" to overwrite local files?
- How do I list all files of a directory?
- How to check whether a string contains a substring in JavaScript?
- How do I redirect to another webpage?
- How can I iterate over rows in a Pandas DataFrame?
- How do I convert a String to an int in Java?
- Does Python have a string 'contains' substring method?
- How do I check if a string contains a specific word?
Popular Tags
Trending Questions
- UIImageView Frame Doesn't Reflect Constraints
- Is it possible to use adb commands to click on a view by finding its ID?
- How to create a new web character symbol recognizable by html/javascript?
- Why isn't my CSS3 animation smooth in Google Chrome (but very smooth on other browsers)?
- Heap Gives Page Fault
- Connect ffmpeg to Visual Studio 2008
- Both Object- and ValueAnimator jumps when Duration is set above API LvL 24
- How to avoid default initialization of objects in std::vector?
- second argument of the command line arguments in a format other than char** argv or char* argv[]
- How to improve efficiency of algorithm which generates next lexicographic permutation?
- Navigating to the another actvity app getting crash in android
- How to read the particular message format in android and store in sqlite database?
- Resetting inventory status after order is cancelled
- Efficiently compute powers of X in SSE/AVX
- Insert into an external database using ajax and php : POST 500 (Internal Server Error)