DeepSeek-Coder: FIM doesn't work
Fill in the middle is performing very badly for me
prompt:
<|fim▁begin|>#!/usr/bin/env python3
# get openai completion API in python
<|fim▁hole|>
<|fim▁end|>
Response:
<aiParent�ianBaidC < <Au7AgentX 9 <pK9pKpf8+25yK6Y1ZbK+nSdS9b3vb/K1R2t9f47XaZ8NV3c2vDZf2eH3f781/7e2mznFc+e9zfEeV/e5N3v
However, without FIM tokens:
#!/usr/bin/env python3
# get openai completion
import requests
def main():
API_KEY = "YOUR OPENAI API KEY"
headers = {'Authorization': f'Bearer {API_KEY}'}
data = {
'model': 'text-davinci-002',
'prompt': 'The current temperature in Stockholm is ...',
'max_tokens': 9,
'temperature': 0.1,
'top_p': 1,
'frequency_penalty': 0,
'presence_penalty': 0,
'stop': ['\n','\r']
This is just an example, it doesn’t work for large code either, neither for the 1.3B, 6.7B nor 33B I also already checked, the special tokens get encoded right Am I doing something wrong?
About this issue
- Original URL
- State: closed
- Created 7 months ago
- Comments: 15
You are welcome!