Prompt engineering for DeepSeek-R1.
temperature
within the range of 0.5-0.7 (0.6 is recommended) to prevent endless repetitions or incoherent outputs. Also, a top-p
of 0.95 is recommended.<think>
: On rare occasions, DeepSeek-R1 tends to bypass the thinking pattern, which can adversely affect the model’s performance. In this case, the response will not start with a <think>
tag. If you see this problem, try telling the model to start with the <think>
tag.