使用MindIE原生接口
本章节以文本推理接口和流式推理接口为例介绍接口调用,其他接口的调用方法请参见MindIE原生接口章节。
文本推理接口
接口名 |
文本推理接口 |
---|---|
URL |
https://{服务IP地址}:{端口号}/infer |
请求类型 |
POST |
请求示例 |
curl -H "Accept: application/json" -H "Content-type: application/json" --cacert ca.pem --cert client.pem --key client.key.pem -X POST -d '{ "inputs": "My name is Olivier and I", "stream": false, "parameters": { "temperature": 0.5, "top_k": 10, "top_p": 0.95, "max_new_tokens": 20, "do_sample": true, "seed": null, "repetition_penalty": 1.03, "details": true, "typical_p": 0.5, "watermark": false, "priority": 5, "timeout": 10 } }' https://127.0.0.1:1025/infer |
返回示例 |
{ "generated_text": "am a French native speaker. I am looking for a job in the hospitality industry. I", "details": { "finish_reason": "length", "generated_tokens": 20, "seed": 846930886 } } |
流式推理接口
接口名 |
流式推理接口 |
---|---|
URL |
https://{服务IP地址}:{端口号}/infer |
请求类型 |
POST |
请求示例 |
curl -H "Accept: application/json" -H "Content-type: application/json" --cacert ca.pem --cert client.pem --key client.key.pem -X POST -d '{ "inputs": "My name is Olivier and I", "stream": true, "parameters": { "temperature": 0.5, "top_k": 10, "top_p": 0.95, "max_new_tokens": 20, "do_sample": true, "seed": null, "repetition_penalty": 1.03, "details": true, "typical_p": 0.5, "watermark": false, "priority": 5, "timeout": 10 } }' https://127.0.0.1:1025/infer |
返回示例 |
data: {"prefill_time":45.54,"decode_time":null,"token":{"id":626,"text":"am"}} data: {"prefill_time":null,"decode_time":128.32,"token":{"id":263,"text":" a"}} data: {"prefill_time":null,"decode_time":18.17,"token":{"id":5176,"text":" French"}} data: {"prefill_time":null,"decode_time":16.80,"token":{"id":17739,"text":" photograph"}} data: {"prefill_time":null,"decode_time":16.80,"token":{"id":261,"text":"er"}} data: {"prefill_time":null,"decode_time":16.80,"token":{"id":2729,"text":" based"}} data: {"prefill_time":null,"decode_time":16.80,"token":{"id":297,"text":" in"}} data: {"prefill_time":null,"decode_time":16.80,"token":{"id":3681,"text":" Paris"}} data: {"prefill_time":null,"decode_time":16.80,"token":{"id":29889,"text":"."}} data: {"prefill_time":null,"decode_time":16.80,"token":{"id":13,"text":"\n"}} data: {"prefill_time":null,"decode_time":16.80,"token":{"id":29902,"text":"I"}} data: {"prefill_time":null,"decode_time":16.80,"token":{"id":505,"text":" have"}} data: {"prefill_time":null,"decode_time":16.80,"token":{"id":1063,"text":" been"}} data: {"prefill_time":null,"decode_time":16.80,"token":{"id":27904,"text":" shooting"}} data: {"prefill_time":null,"decode_time":16.80,"token":{"id":1951,"text":" since"}} data: {"prefill_time":null,"decode_time":16.80,"token":{"id":306,"text":" I"}} data: {"prefill_time":null,"decode_time":16.80,"token":{"id":471,"text":" was"}} data: {"prefill_time":null,"decode_time":16.80,"token":{"id":29871,"text":" "}} data: {"prefill_time":null,"decode_time":16.80,"token":{"id":29896,"text":"1"}} data: {"prefill_time":null,"decode_time":16.80,"generated_text":"am a French photographer based in Paris.\nI have been shooting since I was 15","details":{"finish_reason":"length","generated_tokens":20,"seed":846930886},"token":{"id":29945,"text":null}} |
父主题: 服务化接口调用