Finetune documentation¶
Finetune is a python library designed to make finetuning pre-trained language models for custom natural language processing tasks a breeze.
It ships with pre-trained model weights from “Improving Language Understanding by Generative Pre-Training” and builds off the OpenAI/finetune-language-model repository.
Source code for finetune is available on github.
Installation¶
Finetune can be installed directly from PyPI by using pip
pip install finetune
or installed directly from source:
git clone https://github.com/IndicoDataSolutions/finetune
cd finetune
python3 setup.py develop
You can optionally run the provided test suite to ensure installation completed successfully.
nosetests
Finetune Quickstart Guide¶
Finetuning the base language model is as easy as calling LanguageModelClassifier.fit()
:
model = LanguageModelClassifier() # load base model
model.fit(trainX, trainY) # finetune base model on custom data
predictions = model.predict(testX) # predict on unseen examples
model.save(path) # serialize the model to disk
Easily reload saved models from disk by using LanguageModelClassifier.load()
:
model = LanguageModelClassifier.load(path)
predictions = model.predict(testX)