Instructions to use OpenSafetyLab/MD-Judge-v0.1 with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use OpenSafetyLab/MD-Judge-v0.1 with Transformers:
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("text-generation", model="OpenSafetyLab/MD-Judge-v0.1")# Load model directly from transformers import AutoTokenizer, AutoModelForCausalLM tokenizer = AutoTokenizer.from_pretrained("OpenSafetyLab/MD-Judge-v0.1") model = AutoModelForCausalLM.from_pretrained("OpenSafetyLab/MD-Judge-v0.1") - Inference
- Notebooks
- Google Colab
- Kaggle
- Local Apps
- vLLM
How to use OpenSafetyLab/MD-Judge-v0.1 with vLLM:
Install from pip and serve model
# Install vLLM from pip: pip install vllm # Start the vLLM server: vllm serve "OpenSafetyLab/MD-Judge-v0.1" # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:8000/v1/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "OpenSafetyLab/MD-Judge-v0.1", "prompt": "Once upon a time,", "max_tokens": 512, "temperature": 0.5 }'Use Docker
docker model run hf.co/OpenSafetyLab/MD-Judge-v0.1
- SGLang
How to use OpenSafetyLab/MD-Judge-v0.1 with SGLang:
Install from pip and serve model
# Install SGLang from pip: pip install sglang # Start the SGLang server: python3 -m sglang.launch_server \ --model-path "OpenSafetyLab/MD-Judge-v0.1" \ --host 0.0.0.0 \ --port 30000 # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:30000/v1/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "OpenSafetyLab/MD-Judge-v0.1", "prompt": "Once upon a time,", "max_tokens": 512, "temperature": 0.5 }'Use Docker images
docker run --gpus all \ --shm-size 32g \ -p 30000:30000 \ -v ~/.cache/huggingface:/root/.cache/huggingface \ --env "HF_TOKEN=<secret>" \ --ipc=host \ lmsysorg/sglang:latest \ python3 -m sglang.launch_server \ --model-path "OpenSafetyLab/MD-Judge-v0.1" \ --host 0.0.0.0 \ --port 30000 # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:30000/v1/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "OpenSafetyLab/MD-Judge-v0.1", "prompt": "Once upon a time,", "max_tokens": 512, "temperature": 0.5 }' - Docker Model Runner
How to use OpenSafetyLab/MD-Judge-v0.1 with Docker Model Runner:
docker model run hf.co/OpenSafetyLab/MD-Judge-v0.1
Is there a suitable method of fine tuning for multiple languages?
Thank you for publishing this great model! I would like to use this model to safety evaluation in multiple languages.
I would like to apply fine tuning, are there any tips or scripts for this?
e.g.
- Can I just replace the following from the uses description on this model page with other language and do the instruct tuning?
<BEGIN CONVERSATION
User: %s <= other language text
Agent: %s <= other language text
<END CONVERSATION>
Thanks for following our work! In response to your question, here are my suggestions:
- The vast majority of the data used for fine-tuning our model is in English, so if you have a multilingual use for it, using multilingual Q&A pairs to replace the
Conversationpart is necessary. - According to the fine-tuning philosophy of our evaluation model, the model needs to understand the
TASK INSTRUCTIONandSAFETY CATEGORYto better provide evaluations. And, the base model is predominantly in English, so using English forTASK INSTRUCTIONandSAFETY CATEGORYshould help the model understand it better. Perhaps I would suggest that you replace theConversationpart with the language you need, keep the rest in English, and then modify the instructions in the `TASK INSTRUCTION to help the model understand your multilingual needs.
Since we have no immediate plans to extend the model's multilingual capabilities, we are unable to provide more detailed training tips in the multilingual area.
Thanks.
Thanks for your reply! That's very helpful.
I will try it and share if I get interesting results.