DeepSeek-Coder: FIM doesn't work

Fill in the middle is performing very badly for me

prompt:

<｜fim▁begin｜>#!/usr/bin/env python3
# get openai completion API in python
<｜fim▁hole｜>
<｜fim▁end｜>

Response:

                                                                           <aiParent�ianBaidC                                            <                                                              <Au7AgentX                                                                      9   <pK9pKpf8+25yK6Y1ZbK+nSdS9b3vb/K1R2t9f47XaZ8NV3c2vDZf2eH3f781/7e2mznFc+e9zfEeV/e5N3v

However, without FIM tokens:

#!/usr/bin/env python3
# get openai completion

import requests

def main():
    API_KEY = "YOUR OPENAI API KEY"
    headers = {'Authorization': f'Bearer {API_KEY}'}
    data = {
        'model': 'text-davinci-002',
        'prompt': 'The current temperature in Stockholm is ...',
        'max_tokens': 9,
        'temperature': 0.1,
        'top_p': 1,
        'frequency_penalty': 0,
        'presence_penalty': 0,
        'stop': ['\n','\r']

This is just an example, it doesn’t work for large code either, neither for the 1.3B, 6.7B nor 33B I also already checked, the special tokens get encoded right Am I doing something wrong?

About this issue

Original URL
State: closed
Created 7 months ago
Comments: 15

Most upvoted comments

you need to add the bos token for the base models.
No. We do not use FIM finetuned tasks. However, the instruction model still can work for FIM.

pkuzqh on Dec 7, 2023

You are welcome!

pkuzqh on Dec 7, 2023