{"id":72099,"date":"2026-02-04T22:18:14","date_gmt":"2026-02-04T14:18:14","guid":{"rendered":"https:\/\/www.wsisp.com\/helps\/72099.html"},"modified":"2026-02-04T22:18:14","modified_gmt":"2026-02-04T14:18:14","slug":"%e8%87%aa%e6%89%98%e7%ae%a1-llms-%e6%97%b6%ef%bc%8c%e4%bd%a0%e7%9a%84%e6%9c%8d%e5%8a%a1%e5%99%a8%e8%83%bd%e6%89%bf%e5%8f%97%e5%a4%9a%e5%a4%a7%e7%9a%84%e5%8e%8b%e5%8a%9b%ef%bc%9f","status":"publish","type":"post","link":"https:\/\/www.wsisp.com\/helps\/72099.html","title":{"rendered":"\u81ea\u6258\u7ba1 LLMs \u65f6\uff0c\u4f60\u7684\u670d\u52a1\u5668\u80fd\u627f\u53d7\u591a\u5927\u7684\u538b\u529b\uff1f"},"content":{"rendered":"<p>\u539f\u6587&#xff1a;towardsdatascience.com\/load-testing-self-hosted-llms-29ca8a4cf43a<\/p>\n<p>\u5f53\u4e00\u7fa4\u7528\u6237\u7a81\u7136\u5f00\u59cb\u4f7f\u7528\u53ea\u6709\u4f60\u548c\u4f60\u7684\u5f00\u53d1\u56e2\u961f\u4e4b\u524d\u4f7f\u7528\u8fc7\u7684\u5e94\u7528\u7a0b\u5e8f\u65f6&#xff0c;\u611f\u89c9\u5982\u4f55&#xff1f;<\/p>\n<p>\u8fd9\u662f\u4ece\u539f\u578b\u5230\u751f\u4ea7\u7684\u767e\u4e07\u7f8e\u5143\u95ee\u9898\u3002<\/p>\n<p>\u5c31 LLMs \u800c\u8a00&#xff0c;\u4f60\u53ef\u4ee5\u5728\u9884\u7b97\u548c\u53ef\u63a5\u53d7\u7684\u54c1\u8d28\u5185\u8fdb\u884c\u51e0\u5341\u79cd\u8c03\u6574\u6765\u8fd0\u884c\u4f60\u7684\u5e94\u7528\u7a0b\u5e8f\u3002\u4f8b\u5982&#xff0c;\u4f60\u53ef\u4ee5\u9009\u62e9\u91cf\u5316\u6a21\u578b\u4ee5\u964d\u4f4e\u5185\u5b58\u4f7f\u7528\u3002\u6216\u8005\u4f60\u53ef\u4ee5\u5fae\u8c03\u4e00\u4e2a\u5c0f\u578b\u6a21\u578b&#xff0c;\u51fb\u8d25\u5927\u578b LLMs \u7684\u6027\u80fd\u3002<\/p>\n<p>\u6211\u5fae\u8c03\u4e86 Tiny Llama 3.2 1B \u4ee5\u66ff\u4ee3 GPT-4o<\/p>\n<p>\u4f60\u751a\u81f3\u53ef\u4ee5\u8c03\u6574\u4f60\u7684\u57fa\u7840\u8bbe\u65bd\u4ee5\u83b7\u5f97\u66f4\u597d\u7684\u7ed3\u679c\u3002\u4f8b\u5982&#xff0c;\u4f60\u53ef\u80fd\u60f3\u5c06\u4f7f\u7528\u7684 GPU \u6570\u91cf\u52a0\u500d\u6216\u9009\u62e9\u6700\u65b0\u4e00\u4ee3\u7684 GPU\u3002<\/p>\n<p>\u4f46\u4f60\u600e\u4e48\u80fd\u8bf4\u9009\u9879 A \u6bd4\u9009\u9879 B \u548c C \u8868\u73b0\u66f4\u597d\u5462&#xff1f;<\/p>\n<p>\u5728\u8fdb\u5165\u751f\u4ea7\u9636\u6bb5\u7684\u6700\u65e9\u671f&#xff0c;\u8fd9\u662f\u4e00\u4e2a\u91cd\u8981\u7684\u95ee\u9898\u3002\u6240\u6709\u8fd9\u4e9b\u9009\u9879\u90fd\u6709\u5b83\u4eec\u7684\u6210\u672c\u2014\u2014\u57fa\u7840\u8bbe\u65bd\u6210\u672c\u6216\u4e22\u5931\u7684\u6700\u7ec8\u7528\u6237\u4f53\u9a8c\u3002<\/p>\n<p>\u8fd9\u4e2a\u5173\u952e\u95ee\u9898\u7684\u89e3\u51b3\u65b9\u6848\u5e76\u4e0d\u65b0\u9896\u3002\u8d1f\u8f7d\u6d4b\u8bd5\u5728\u6240\u6709\u8f6f\u4ef6\u53d1\u5e03\u4e2d\u90fd\u5df2\u88ab\u5b9e\u8df5\u3002<\/p>\n<p>\u5728\u8fd9\u7bc7\u6587\u7ae0\u4e2d&#xff0c;\u6211\u5c06\u8ba8\u8bba\u5982\u4f55\u4f7f\u7528\u514d\u8d39\u7684 Postman \u5e94\u7528\u7a0b\u5e8f\u5feb\u901f\u8fdb\u884c\u8d1f\u8f7d\u6d4b\u8bd5\u3002\u6211\u4eec\u8fd8\u5c06\u5c1d\u8bd5\u5728\u5355\u4e2a A40 GPU\u30012 \u500d\u4e8e\u6b64\u6216\u5347\u7ea7\u5230 L40S GPU \u4e4b\u95f4\u9009\u62e9\u6700\u4f73\u7684\u57fa\u7840\u8bbe\u65bd\u3002<\/p>\n<h3>\u8ba1\u5212&#xff1a;\u6211\u4eec\u5982\u4f55\u51b3\u5b9a\u57fa\u7840\u8bbe\u65bd<\/h3>\n<p>\u8fd9\u662f\u6211\u4eec\u7684\u76ee\u6807\u3002<\/p>\n<p>\u6211\u4eec\u4e3a\u63a8\u7406\u670d\u52a1\u6258\u7ba1\u4e86Llama 3.1 8B&#xff0c;\u5e76\u4f7f\u7528 Ollama \u6765\u90e8\u7f72\u6211\u4eec\u7684\u6a21\u578b\u3002\u7136\u800c&#xff0c;\u6211\u4eec\u4e0d\u77e5\u9053\u6258\u7ba1\u6b64\u6a21\u578b\u7684\u786c\u4ef6\u662f\u5426\u8db3\u591f\u3002<\/p>\n<p>\u6211\u4eec\u76ee\u524d\u90e8\u7f72\u4e86\u4e00\u4e2aA40 GPU&#xff0c;\u62e5\u6709 48 GB \u7684 VRAM&#xff0c;50 GB \u7684 RAM \u548c\u4e00\u4e2a 9vCPU \u6765\u670d\u52a1\u4e8e\u63a8\u7406\u5f15\u64ce\u3002\u6211\u4eec\u79df\u7528\u8fd9\u4e2a\u57fa\u7840\u8bbe\u65bd\u7684\u8d39\u7528\u4e3a\u6bcf\u6708 280.8 \u7f8e\u5143\u3002<\/p>\n<p>\u5728\u6211\u4eec\u4e0a\u7ebf\u4e4b\u524d&#xff0c;\u6211\u4eec\u9700\u8981\u786e\u4fdd\u8fd9\u8db3\u4ee5\u81f3\u5c11\u670d\u52a1\u4e8e 100 \u4e2a\u7528\u6237\u3002<\/p>\n<p>\u8ba9\u6211\u4eec\u5047\u8bbe\u5176\u4ed6\u9009\u9879\u662f\u62e5\u6709\u76f8\u540c GPU \u7684\u53e6\u4e00\u4e2a\u5b9e\u4f8b&#xff08;\u6210\u672c\u52a0\u500d&#xff09;\u548c\u79df\u7528\u4e00\u4e2aL40S GPU&#xff0c;\u62e5\u6709 48 GB VRAM&#xff0c;62 GB RAM \u548c 16 \u4e2a vCPU\u3002\u540e\u8005\u6bcf\u6708\u8d39\u7528\u4e3a 741.6 \u7f8e\u5143\u3002<\/p>\n<p>\u5982\u679c\u4f60\u5df2\u7ecf\u51b3\u5b9a\u79df\u7528 GPU&#xff0c;\u4f60\u5c06\u62e5\u6709\u6bd4\u8fd9\u4e24\u4e2a\u66f4\u591a\u7684\u9009\u62e9\u3002\u4f46\u73b0\u5728\u8ba9\u6211\u4eec\u53ea\u8003\u8651\u8fd9\u4e24\u4e2a\u3002<\/p>\n<p>\u6211\u4eec\u5c06\u901a\u8fc7\u7ed9\u5b83\u4eec\u76f8\u540c\u7684\u4efb\u52a1\u6765\u6d4b\u8bd5\u8fd9\u4e9b\u9009\u9879\u3002\u6211\u4eec\u8fd8\u5c06\u6a21\u62df 50 \u4e2a\u865a\u62df\u7528\u6237&#xff0c;\u5e76\u5c06\u6570\u91cf\u589e\u52a0\u5230 100&#xff0c;\u4ee5\u67e5\u770b\u5b83\u5982\u4f55\u5f71\u54cd\u54cd\u5e94\u65f6\u95f4\u548c\u9519\u8bef\u7387\u3002<\/p>\n<p>\u8ba9\u6211\u4eec\u5f00\u59cb\u5427\u3002<\/p>\n<h3>\u8bbe\u7f6e Postman \u8fdb\u884c LLM \u8d1f\u8f7d\u6d4b\u8bd5<\/h3>\n<p>\u60a8\u53ef\u4ee5\u4ece\u4ed6\u4eec\u7684\u7f51\u7ad9\u514d\u8d39\u4e0b\u8f7d Postman \u5e94\u7528\u7a0b\u5e8f\u3002\u6211\u4e0d\u4f1a\u8be6\u7ec6\u4ecb\u7ecd\u5b89\u88c5\u8bf4\u660e\u3002<\/p>\n<p>\u5047\u8bbe\u6211\u4eec\u7684 LLM \u516c\u5f00\u4e86\u4e00\u4e2a api\/generate \u7aef\u70b9&#xff0c;\u6211\u4eec\u5c06\u4f7f\u7528\u5b83\u6765\u751f\u6210\u63d0\u793a\u7684\u8f93\u51fa\u3002\u4ee5\u4e0b\u662f\u4e00\u4e2a cURL \u793a\u4f8b\u3002<\/p>\n<p>curl <span class=\"token operator\">&#8211;<\/span><span class=\"token operator\">&#8211;<\/span>location <span class=\"token string\">&#039;https:\/\/&lt;api-host&gt;\/api\/generate&#039;<\/span><br \/>\n<span class=\"token operator\">&#8211;<\/span><span class=\"token operator\">&#8211;<\/span>header <span class=\"token string\">&#039;Content-Type: application\/json&#039;<\/span><br \/>\n<span class=\"token operator\">&#8211;<\/span><span class=\"token operator\">&#8211;<\/span>data &#039;<span class=\"token punctuation\">{<\/span><br \/>\n    <span class=\"token string\">&#034;model&#034;<\/span><span class=\"token punctuation\">:<\/span> <span class=\"token string\">&#034;llama3.1:8b&#034;<\/span><span class=\"token punctuation\">,<\/span><br \/>\n    <span class=\"token string\">&#034;prompt&#034;<\/span><span class=\"token punctuation\">:<\/span> <span class=\"token string\">&#034;Write a 100 word essay about a random public figure&#034;<\/span><span class=\"token punctuation\">,<\/span><br \/>\n    <span class=\"token string\">&#034;stream&#034;<\/span><span class=\"token punctuation\">:<\/span> false<br \/>\n<span class=\"token punctuation\">}<\/span>&#039;<\/p>\n<p>\u4e0a\u8ff0\u793a\u4f8b\u8981\u6c42\u6211\u4eec\u7684\u670d\u52a1\u5668\u4e0a\u7684 Llama3.1:8b \u521b\u5efa\u4e00\u7bc7 100 \u5b57\u7684\u968f\u673a\u6587\u7ae0\u3002\u8ba9\u6211\u4eec\u5728 Postman \u4e0a\u5b8c\u6210\u8fd9\u4e2a\u4efb\u52a1\u3002<\/p>\n<p>\u6253\u5f00 Postman \u5e94\u7528&#xff0c;\u521b\u5efa\u4e00\u4e2a\u65b0\u7684\u96c6\u5408&#xff0c;\u5e76\u7ed9\u5b83\u547d\u540d\u3002<\/p>\n<p>2026-02-04kyx0fithnjz.png<\/p>\n<p>\u5728 Postman \u4e2d\u521b\u5efa\u65b0\u96c6\u5408 \u2013 \u4f5c\u8005\u622a\u56fe\u3002<\/p>\n<p>\u7136\u540e\u70b9\u51fb\u65b0\u521b\u5efa\u7684\u96c6\u5408\u4e0a\u65b9\u7684\u5bfc\u5165\u6309\u94ae&#xff0c;\u5e76\u5c06\u793a\u4f8b cURL \u7c98\u8d34\u4ee5\u4e0e\u60a8\u7684 LLM \u670d\u52a1\u5668\u901a\u4fe1\u3002\u786e\u4fdd\u60a8\u5df2\u9009\u62e9\u4e86\u6b63\u786e\u7684\u96c6\u5408\u3002\u7136\u540e\u70b9\u51fb\u5bfc\u5165\u5230\u96c6\u5408\u4e2d\u3002<\/p>\n<p>2026-02-04ixohyojkyyt.png<\/p>\n<p>\u5c06 cURL \u5bfc\u5165\u96c6\u5408\u4e2d \u2013 \u4f5c\u8005\u622a\u56fe\u3002<\/p>\n<p>\u5982\u679c\u60a8\u7684 API \u7aef\u70b9\u662f\u53d7\u4fdd\u62a4\u7684\u5e76\u4e14\u9700\u8981\u4ee4\u724c&#xff0c;\u60a8\u53ef\u4ee5\u5728\u65b0\u51fa\u73b0\u7684\u7a97\u53e3\u4e2d\u8fdb\u884c\u914d\u7f6e\u3002\u5982\u679c\u9700\u8981&#xff0c;\u6211\u4eec\u8fd8\u53ef\u4ee5\u7f16\u8f91\u8bf7\u6c42\u6b63\u6587\u3002<\/p>\n<p>\u8ba9\u6211\u4eec\u53d1\u9001\u4e00\u4e2a\u8bf7\u6c42&#xff0c;\u770b\u770b\u5b83\u662f\u5426\u6309\u9884\u671f\u5de5\u4f5c&#xff1a;<\/p>\n<p>2026-02-04ibjpymt1qf0.png<\/p>\n<p>\u4ece Postman \u8c03\u7528 Llama 3.1 8b \u2013 \u4f5c\u8005\u622a\u56fe\u3002<\/p>\n<p>\u5de5\u4f5c\u6b63\u5e38\u3002\u6211\u4eec\u51c6\u5907\u597d\u5f00\u59cb\u5bf9\u8fd9\u4e2a\u670d\u52a1\u5668\u8fdb\u884c\u7b2c\u4e00\u6b21\u8d1f\u8f7d\u6d4b\u8bd5\u3002<\/p>\n<p>2026-02-04zm43zxbeovf.png<\/p>\n<p>\u5728 Postman \u4e2d\u521b\u5efa\u8d1f\u8f7d\u6d4b\u8bd5\u8fd0\u884c\u5668 \u2013 \u4f5c\u8005\u622a\u56fe\u3002<\/p>\n<p>\u9996\u5148&#xff0c;\u70b9\u51fb\u96c6\u5408\u540d\u79f0\u65c1\u8fb9\u7684\u4e09\u4e2a\u70b9&#xff0c;\u7136\u540e\u70b9\u51fb\u201c\u8fd0\u884c\u96c6\u5408\u201d&#xff0c;\u5982\u56fe\u6240\u793a\u3002\u73b0\u5728&#xff0c;\u6211\u4eec\u6709\u51e0\u4e2a\u8d1f\u8f7d\u914d\u7f6e\u6587\u4ef6\u53ef\u4f9b\u9009\u62e9\u3002\u6211\u7ecf\u5e38\u4f7f\u7528\u56fa\u5b9a\u8d1f\u8f7d\u914d\u7f6e\u6587\u4ef6\u548c Ramp-Up \u914d\u7f6e\u6587\u4ef6\u3002<\/p>\n<p>\u56fa\u5b9a\u8d1f\u8f7d\u914d\u7f6e\u6587\u4ef6&#xff1a;\u6b63\u5982\u5176\u540d\u6240\u793a&#xff0c;\u8fd9\u79cd\u6d4b\u8bd5\u6a21\u62df\u4e86\u914d\u7f6e\u7684\u865a\u62df\u7528\u6237\u6570\u91cf&#xff0c;\u5e76\u5f00\u59cb\u53d1\u9001\u8bf7\u6c42\u3002\u5728\u6574\u4e2a\u6d4b\u8bd5\u8fc7\u7a0b\u4e2d\u4fdd\u6301\u76f8\u540c\u7684\u7528\u6237\u6c34\u5e73\u3002<\/p>\n<p>Ramp-Up \u914d\u7f6e\u6587\u4ef6&#xff1a;\u8fd9\u79cd\u6280\u672f\u5728\u6574\u4e2a\u6d4b\u8bd5\u8fc7\u7a0b\u4e2d\u9010\u6e10\u589e\u52a0\u865a\u62df\u7528\u6237\u6570\u91cf&#xff0c;\u5e76\u6536\u96c6\u8bf7\u6c42\u5904\u7406\u65f6\u95f4\u548c\u9519\u8bef\u7387\u3002<\/p>\n<p>\u5bf9\u4e8e\u6211\u7684\u6d4b\u8bd5&#xff0c;\u6211\u5c06\u4f7f\u7528\u5e26\u6709 50 \u4e2a\u521d\u59cb\u7528\u6237\u7684 Ramp Up \u914d\u7f6e\u6587\u4ef6&#xff0c;\u5e76\u5c06\u5176\u589e\u52a0\u5230 100\u3002\u5c31\u8fd9\u6837&#xff1b;\u73b0\u5728\u70b9\u51fb\u8fd0\u884c\u4ee5\u8fdb\u884c\u8d1f\u8f7d\u6d4b\u8bd5\u3002<\/p>\n<h3>\u8d1f\u8f7d\u6d4b\u8bd5\u8fd0\u884c\u548c\u7ed3\u679c \u2013 \u5355\u4e2a A10 GPU\u3002<\/h3>\n<p>Postman \u73b0\u5728\u5c06\u5f00\u59cb\u4f7f\u7528\u865a\u62df\u7528\u6237\u53d1\u9001\u8bf7\u6c42\u3002\u60a8\u53ef\u4ee5\u76d1\u63a7\u8fd9\u4e2a\u8fc7\u7a0b\u3002\u4ee5\u4e0b\u662f\u7ed3\u679c\u53ef\u80fd\u770b\u8d77\u6765\u50cf\u4ec0\u4e48\u3002<\/p>\n<p>2026-02-04sohbwb4mr20.png<\/p>\n<p>Llama3.1:8b \u5728 A10 GPU \u4e0a\u7684\u8d1f\u8f7d\u6d4b\u8bd5\u7ed3\u679c \u2013 \u4f5c\u8005\u622a\u56fe\u3002<\/p>\n<p>\u8fd9\u662f\u6211\u4eec\u57fa\u7ebf\u57fa\u7840\u8bbe\u65bd&#xff1a;\u4e00\u4e2a A10 GPU\u3002\u622a\u56fe\u8868\u660e&#xff0c;Postman \u5728 3 \u5206\u949f\u5185\u53d1\u9001\u4e86 318 \u4e2a\u8bf7\u6c42\u3002\u4e5f\u5c31\u662f\u8bf4&#xff0c;\u6211\u4eec\u7684\u670d\u52a1\u5668\u6bcf\u79d2\u54cd\u5e94\u5927\u7ea6 1.71 \u4e2a\u8bf7\u6c42\u3002\u7136\u800c&#xff0c;\u8bf7\u6c42\u7684\u5e73\u5747\u5904\u7406\u65f6\u95f4\u4e3a 34 \u79d2\u3002\u8fd9\u610f\u5473\u7740\u7528\u6237\u5c06\u4e0d\u5f97\u4e0d\u7b49\u5f85\u534a\u5206\u949f\u624d\u80fd\u4ece\u6211\u4eec\u7684\u670d\u52a1\u5668\u83b7\u5f97\u54cd\u5e94\u3002\u6b64\u5916&#xff0c;2.2%\u7684\u8bf7\u6c42\u6ca1\u6709\u83b7\u5f97\u54cd\u5e94\u3002<\/p>\n<p>\u56fe\u8868\u8fd8\u663e\u793a&#xff0c;\u54cd\u5e94\u65f6\u95f4\u6700\u521d\u8f83\u77ed&#xff0c;\u4f46\u968f\u7740\u670d\u52a1\u5668\u63a5\u6536\u66f4\u591a\u8bf7\u6c42\u800c\u6076\u5316\u3002<\/p>\n<p>\u6839\u636e\u6211\u4eec\u7684\u60c5\u51b5&#xff0c;\u6211\u4eec\u53ef\u80fd\u53ef\u4ee5\u63a5\u53d7\u8fd9\u79cd\u6027\u80fd\u6216\u5c1d\u8bd5\u6539\u8fdb\u5b83\u3002\u4f46\u8ba9\u6211\u4eec\u4e5f\u5c1d\u8bd5\u5176\u4ed6\u57fa\u7840\u8bbe\u65bd&#xff0c;\u770b\u770b\u6211\u4eec\u53ef\u4ee5\u5b9e\u73b0\u591a\u5c11\u6539\u8fdb\u3002<\/p>\n<h3>\u4f7f\u7528\u66f4\u591a GPU \u8fdb\u884c\u8d1f\u8f7d\u6d4b\u8bd5 \u2013 2X A10 GPU<\/h3>\n<p>\u73b0\u5728\u6211\u4eec\u5207\u6362\u5230\u53e6\u4e00\u53f0\u670d\u52a1\u5668&#xff0c;\u8be5\u670d\u52a1\u5668\u914d\u5907\u4e86\u4e24\u4e2a\u76f8\u540c\u7684 A10 GPU\u3002\u6211\u4eec\u7ee7\u7eed\u4f7f\u7528\u76f8\u540c\u7684\u8d1f\u8f7d\u6d4b\u8bd5\u4efb\u52a1\u3002<\/p>\n<p>\u4e0b\u9762\u662f\u7ed3\u679c\u7684\u6837\u5b50\u3002<\/p>\n<p>2026-02-04krkydvvzquo.png<\/p>\n<p>Llama3.1:8b \u5728 2X A10 GPU \u4e0a\u7684\u8d1f\u8f7d\u6d4b\u8bd5\u7ed3\u679c \u2013 \u4f5c\u8005\u622a\u56fe<\/p>\n<p>\u7ed3\u679c\u663e\u793a&#xff0c;\u6211\u4eec\u7684\u57fa\u7ebf\u57fa\u7840\u8bbe\u65bd\u6709\u6240\u6539\u5584\u3002\u54cd\u5e94\u65f6\u95f4\u4ece 34 \u79d2\u964d\u81f3 31 \u79d2\u2014\u2014\u5fae\u4e0d\u8db3\u9053\u3002\u7136\u800c&#xff0c;\u9519\u8bef\u7387\u98d9\u5347\u81f3 5.19%\u3002<\/p>\n<p>\u4ece\u5916\u89c2\u4e0a\u770b&#xff0c;\u82b1\u8d39\u53cc\u500d\u7684\u6210\u672c\u5e76\u4e0d\u503c\u5f97\u8fd9\u79cd\u6539\u8fdb\u3002<\/p>\n<h3>\u4f7f\u7528\u66f4\u597d\u7684 GPU \u8fdb\u884c\u8d1f\u8f7d\u6d4b\u8bd5 \u2013 L40S<\/h3>\n<p>L40S \u662f NVIDIA \u6700\u65b0\u4e00\u4ee3 GPU \u4e4b\u4e00\u3002\u5c3d\u7ba1\u5982\u6b64&#xff0c;\u5b83\u7684 VRAM \u4e0e A10 \u5927\u81f4\u76f8\u540c\u3002\u8ba9\u6211\u4eec\u770b\u770b\u5b83\u5728\u7c7b\u4f3c\u60c5\u51b5\u4e0b\u7684\u8868\u73b0\u3002<\/p>\n<p>2026-02-04oqnplltyew4.png<\/p>\n<p>Llama3.1:8b \u5728 L40S GPU \u4e0a\u7684\u8d1f\u8f7d\u6d4b\u8bd5\u7ed3\u679c \u2013 \u4f5c\u8005\u622a\u56fe<\/p>\n<p>\u7ed3\u679c\u662f\u6df1\u523b\u7684\u3002<\/p>\n<p>\u5e73\u5747\u5904\u7406\u65f6\u95f4\u5df2\u964d\u81f3 26 \u79d2\u2014\u2014\u4ecd\u7136\u592a\u957f\u3002\u9519\u8bef\u7387\u4e5f\u964d\u81f3 1.11%\u3002<\/p>\n<p>\u6700\u5927\u7684\u7f3a\u70b9\u662f L40S \u7684\u6210\u672c\u6bd4 2X A10 \u9ad8\u5f97\u591a\u3002\u8fd9\u53ef\u80fd\u5bf9\u4e8e\u5173\u952e\u4efb\u52a1\u63a8\u7406\u9700\u6c42\u662f\u5fc5\u8981\u7684\u3002\u7136\u800c&#xff0c;\u6839\u636e\u901a\u7528\u7528\u9014\u7684\u7ed3\u679c&#xff0c;\u6211\u66f4\u503e\u5411\u4e8e\u575a\u6301\u4f7f\u7528\u5355\u4e2a A10\u3002<\/p>\n<p>\u6700\u6709\u4ef7\u503c\u7684 LLM \u5f00\u53d1\u6280\u80fd\u6613\u4e8e\u5b66\u4e60&#xff0c;\u4f46\u5b9e\u8df5\u6210\u672c\u9ad8\u6602<\/p>\n<h3>\u6700\u540e\u7684\u60f3\u6cd5<\/h3>\n<p>\u8d1f\u8f7d\u6d4b\u8bd5\u6709\u52a9\u4e8e\u6211\u4eec\u4e86\u89e3\u670d\u52a1\u5668\u5728\u4e0d\u540c\u6d41\u91cf\u6c34\u5e73\u4e0b\u7684\u884c\u4e3a\u3002\u8fd9\u5bf9\u4e8e\u5f00\u53d1 LLM \u5e94\u7528\u7a0b\u5e8f\u81f3\u5173\u91cd\u8981\u3002\u6211\u4eec\u5e94\u8be5\u7cbe\u786e\u77e5\u9053\u53d1\u9001\u5230\u670d\u52a1\u5668\u7684\u6bcf\u4e2a\u8bf7\u6c42\u7684\u5904\u7406\u65f6\u95f4\u3002<\/p>\n<p>\u8fdb\u884c\u8d1f\u8f7d\u6d4b\u8bd5\u6700\u7b80\u5355\u7684\u65b9\u6cd5\u662f\u4f7f\u7528 Postman\u3002\u5b83\u662f\u514d\u8d39\u7684&#xff0c;\u64cd\u4f5c\u7b80\u5355\u3002\u5f53\u7136&#xff0c;\u5b83\u4e5f\u6709\u4e0d\u8db3\u4e4b\u5904\u3002<\/p>\n<p>\u5728\u8fd9\u7bc7\u6587\u7ae0\u4e2d&#xff0c;\u6211\u4f7f\u7528\u4e86 Postman \u8fdb\u884c\u8d1f\u8f7d\u6d4b\u8bd5&#xff0c;\u5e76\u5728\u4e09\u4e2a\u57fa\u7840\u8bbe\u65bd\u9009\u9879\u4e4b\u95f4\u505a\u51fa\u9009\u62e9\u3002\u6211\u5f97\u51fa\u7ed3\u8bba&#xff0c;\u5bf9\u4e8e\u6211\u7684\u4efb\u52a1\u6765\u8bf4&#xff0c;\u5355\u4e2a A10 GPU \u6bd4\u66f4\u591a GPU \u6216\u5347\u7ea7\u5230\u6700\u65b0\u4e00\u4ee3 GPU \u66f4\u597d\u3002<\/p>\n<p>\u5728\u5b9e\u9645\u5e94\u7528\u4e2d&#xff0c;\u6211\u4eec\u5fc5\u987b\u5728\u51e0\u4e2a\u9009\u62e9\u4e4b\u95f4\u505a\u51fa\u51b3\u5b9a\u2014\u2014\u57fa\u7840\u8bbe\u65bd\u548c\u8bbe\u8ba1\u3002\u5bf9\u5b83\u4eec\u8fdb\u884c\u8d1f\u8f7d\u6d4b\u8bd5\u5c06\u7ed9\u6211\u4eec\u4e00\u4e2a\u51c6\u786e\u7684\u60c5\u51b5&#xff0c;\u5373\u670d\u52a1\u5668\u5728\u7c7b\u4f3c\u73b0\u5b9e\u751f\u6d3b\u4e2d\u7684\u60c5\u51b5\u4e0b\u4f1a\u5982\u4f55\u8868\u73b0\u3002<\/p>\n<hr \/>\n<p>\u611f\u8c22\u9605\u8bfb&#xff0c;\u670b\u53cb&#xff01;\u9664\u4e86 Medium&#xff0c; \u6211\u8fd8\u5728 LinkedIn \u548c X, \u4e0a&#xff01;*<\/p>\n","protected":false},"excerpt":{"rendered":"<p>\u539f\u6587&#xff1a;towardsdatascience.com\/load-testing-self-hosted-llms-29ca8a4cf43a \u5f53\u4e00\u7fa4\u7528\u6237\u7a81\u7136\u5f00\u59cb\u4f7f\u7528\u53ea\u6709\u4f60\u548c\u4f60\u7684\u5f00\u53d1\u56e2\u961f\u4e4b\u524d\u4f7f\u7528\u8fc7\u7684\u5e94\u7528\u7a0b\u5e8f\u65f6&#xff0c;\u611f\u89c9\u5982\u4f55&#xff1f;<br \/>\n\u8fd9\u662f\u4ece\u539f\u578b\u5230\u751f\u4ea7\u7684\u767e\u4e07\u7f8e\u5143\u95ee\u9898\u3002<br \/>\n\u5c31 LLMs \u800c\u8a00&#xff0c;\u4f60\u53ef\u4ee5\u5728\u9884\u7b97\u548c\u53ef\u63a5\u53d7\u7684\u54c1\u8d28\u5185\u8fdb\u884c\u51e0\u5341\u79cd\u8c03\u6574\u6765\u8fd0\u884c\u4f60\u7684\u5e94\u7528\u7a0b\u5e8f\u3002\u4f8b\u5982&#xff0c;\u4f60\u53ef\u4ee5\u9009\u62e9\u91cf\u5316\u6a21\u578b\u4ee5\u964d\u4f4e\u5185\u5b58\u4f7f\u7528\u3002\u6216\u8005\u4f60\u53ef\u4ee5\u5fae\u8c03\u4e00\u4e2a\u5c0f\u578b\u6a21\u578b&amp;#xf<\/p>\n","protected":false},"author":2,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[1],"tags":[7587],"topic":[],"class_list":["post-72099","post","type-post","status-publish","format-standard","hentry","category-server","tag-7587"],"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v20.3 - https:\/\/yoast.com\/wordpress\/plugins\/seo\/ -->\n<title>\u81ea\u6258\u7ba1 LLMs \u65f6\uff0c\u4f60\u7684\u670d\u52a1\u5668\u80fd\u627f\u53d7\u591a\u5927\u7684\u538b\u529b\uff1f - \u7f51\u7855\u4e92\u8054\u5e2e\u52a9\u4e2d\u5fc3<\/title>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/www.wsisp.com\/helps\/72099.html\" \/>\n<meta property=\"og:locale\" content=\"zh_CN\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"\u81ea\u6258\u7ba1 LLMs \u65f6\uff0c\u4f60\u7684\u670d\u52a1\u5668\u80fd\u627f\u53d7\u591a\u5927\u7684\u538b\u529b\uff1f - \u7f51\u7855\u4e92\u8054\u5e2e\u52a9\u4e2d\u5fc3\" \/>\n<meta property=\"og:description\" content=\"\u539f\u6587&#xff1a;towardsdatascience.com\/load-testing-self-hosted-llms-29ca8a4cf43a \u5f53\u4e00\u7fa4\u7528\u6237\u7a81\u7136\u5f00\u59cb\u4f7f\u7528\u53ea\u6709\u4f60\u548c\u4f60\u7684\u5f00\u53d1\u56e2\u961f\u4e4b\u524d\u4f7f\u7528\u8fc7\u7684\u5e94\u7528\u7a0b\u5e8f\u65f6&#xff0c;\u611f\u89c9\u5982\u4f55&#xff1f; \u8fd9\u662f\u4ece\u539f\u578b\u5230\u751f\u4ea7\u7684\u767e\u4e07\u7f8e\u5143\u95ee\u9898\u3002 \u5c31 LLMs \u800c\u8a00&#xff0c;\u4f60\u53ef\u4ee5\u5728\u9884\u7b97\u548c\u53ef\u63a5\u53d7\u7684\u54c1\u8d28\u5185\u8fdb\u884c\u51e0\u5341\u79cd\u8c03\u6574\u6765\u8fd0\u884c\u4f60\u7684\u5e94\u7528\u7a0b\u5e8f\u3002\u4f8b\u5982&#xff0c;\u4f60\u53ef\u4ee5\u9009\u62e9\u91cf\u5316\u6a21\u578b\u4ee5\u964d\u4f4e\u5185\u5b58\u4f7f\u7528\u3002\u6216\u8005\u4f60\u53ef\u4ee5\u5fae\u8c03\u4e00\u4e2a\u5c0f\u578b\u6a21\u578b&amp;#xf\" \/>\n<meta property=\"og:url\" content=\"https:\/\/www.wsisp.com\/helps\/72099.html\" \/>\n<meta property=\"og:site_name\" content=\"\u7f51\u7855\u4e92\u8054\u5e2e\u52a9\u4e2d\u5fc3\" \/>\n<meta property=\"article:published_time\" content=\"2026-02-04T14:18:14+00:00\" \/>\n<meta name=\"author\" content=\"admin\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:label1\" content=\"\u4f5c\u8005\" \/>\n\t<meta name=\"twitter:data1\" content=\"admin\" \/>\n\t<meta name=\"twitter:label2\" content=\"\u9884\u8ba1\u9605\u8bfb\u65f6\u95f4\" \/>\n\t<meta name=\"twitter:data2\" content=\"2 \u5206\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\/\/schema.org\",\"@graph\":[{\"@type\":\"WebPage\",\"@id\":\"https:\/\/www.wsisp.com\/helps\/72099.html\",\"url\":\"https:\/\/www.wsisp.com\/helps\/72099.html\",\"name\":\"\u81ea\u6258\u7ba1 LLMs \u65f6\uff0c\u4f60\u7684\u670d\u52a1\u5668\u80fd\u627f\u53d7\u591a\u5927\u7684\u538b\u529b\uff1f - \u7f51\u7855\u4e92\u8054\u5e2e\u52a9\u4e2d\u5fc3\",\"isPartOf\":{\"@id\":\"https:\/\/www.wsisp.com\/helps\/#website\"},\"datePublished\":\"2026-02-04T14:18:14+00:00\",\"dateModified\":\"2026-02-04T14:18:14+00:00\",\"author\":{\"@id\":\"https:\/\/www.wsisp.com\/helps\/#\/schema\/person\/358e386c577a3ab51c4493330a20ad41\"},\"breadcrumb\":{\"@id\":\"https:\/\/www.wsisp.com\/helps\/72099.html#breadcrumb\"},\"inLanguage\":\"zh-Hans\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\/\/www.wsisp.com\/helps\/72099.html\"]}]},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\/\/www.wsisp.com\/helps\/72099.html#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"\u9996\u9875\",\"item\":\"https:\/\/www.wsisp.com\/helps\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"\u81ea\u6258\u7ba1 LLMs \u65f6\uff0c\u4f60\u7684\u670d\u52a1\u5668\u80fd\u627f\u53d7\u591a\u5927\u7684\u538b\u529b\uff1f\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\/\/www.wsisp.com\/helps\/#website\",\"url\":\"https:\/\/www.wsisp.com\/helps\/\",\"name\":\"\u7f51\u7855\u4e92\u8054\u5e2e\u52a9\u4e2d\u5fc3\",\"description\":\"\u9999\u6e2f\u670d\u52a1\u5668_\u9999\u6e2f\u4e91\u670d\u52a1\u5668\u8d44\u8baf_\u670d\u52a1\u5668\u5e2e\u52a9\u6587\u6863_\u670d\u52a1\u5668\u6559\u7a0b\",\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\/\/www.wsisp.com\/helps\/?s={search_term_string}\"},\"query-input\":\"required name=search_term_string\"}],\"inLanguage\":\"zh-Hans\"},{\"@type\":\"Person\",\"@id\":\"https:\/\/www.wsisp.com\/helps\/#\/schema\/person\/358e386c577a3ab51c4493330a20ad41\",\"name\":\"admin\",\"image\":{\"@type\":\"ImageObject\",\"inLanguage\":\"zh-Hans\",\"@id\":\"https:\/\/www.wsisp.com\/helps\/#\/schema\/person\/image\/\",\"url\":\"https:\/\/gravatar.wp-china-yes.net\/avatar\/?s=96&d=mystery\",\"contentUrl\":\"https:\/\/gravatar.wp-china-yes.net\/avatar\/?s=96&d=mystery\",\"caption\":\"admin\"},\"sameAs\":[\"http:\/\/wp.wsisp.com\"],\"url\":\"https:\/\/www.wsisp.com\/helps\/author\/admin\"}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"\u81ea\u6258\u7ba1 LLMs \u65f6\uff0c\u4f60\u7684\u670d\u52a1\u5668\u80fd\u627f\u53d7\u591a\u5927\u7684\u538b\u529b\uff1f - \u7f51\u7855\u4e92\u8054\u5e2e\u52a9\u4e2d\u5fc3","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/www.wsisp.com\/helps\/72099.html","og_locale":"zh_CN","og_type":"article","og_title":"\u81ea\u6258\u7ba1 LLMs \u65f6\uff0c\u4f60\u7684\u670d\u52a1\u5668\u80fd\u627f\u53d7\u591a\u5927\u7684\u538b\u529b\uff1f - \u7f51\u7855\u4e92\u8054\u5e2e\u52a9\u4e2d\u5fc3","og_description":"\u539f\u6587&#xff1a;towardsdatascience.com\/load-testing-self-hosted-llms-29ca8a4cf43a \u5f53\u4e00\u7fa4\u7528\u6237\u7a81\u7136\u5f00\u59cb\u4f7f\u7528\u53ea\u6709\u4f60\u548c\u4f60\u7684\u5f00\u53d1\u56e2\u961f\u4e4b\u524d\u4f7f\u7528\u8fc7\u7684\u5e94\u7528\u7a0b\u5e8f\u65f6&#xff0c;\u611f\u89c9\u5982\u4f55&#xff1f; \u8fd9\u662f\u4ece\u539f\u578b\u5230\u751f\u4ea7\u7684\u767e\u4e07\u7f8e\u5143\u95ee\u9898\u3002 \u5c31 LLMs \u800c\u8a00&#xff0c;\u4f60\u53ef\u4ee5\u5728\u9884\u7b97\u548c\u53ef\u63a5\u53d7\u7684\u54c1\u8d28\u5185\u8fdb\u884c\u51e0\u5341\u79cd\u8c03\u6574\u6765\u8fd0\u884c\u4f60\u7684\u5e94\u7528\u7a0b\u5e8f\u3002\u4f8b\u5982&#xff0c;\u4f60\u53ef\u4ee5\u9009\u62e9\u91cf\u5316\u6a21\u578b\u4ee5\u964d\u4f4e\u5185\u5b58\u4f7f\u7528\u3002\u6216\u8005\u4f60\u53ef\u4ee5\u5fae\u8c03\u4e00\u4e2a\u5c0f\u578b\u6a21\u578b&amp;#xf","og_url":"https:\/\/www.wsisp.com\/helps\/72099.html","og_site_name":"\u7f51\u7855\u4e92\u8054\u5e2e\u52a9\u4e2d\u5fc3","article_published_time":"2026-02-04T14:18:14+00:00","author":"admin","twitter_card":"summary_large_image","twitter_misc":{"\u4f5c\u8005":"admin","\u9884\u8ba1\u9605\u8bfb\u65f6\u95f4":"2 \u5206"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"WebPage","@id":"https:\/\/www.wsisp.com\/helps\/72099.html","url":"https:\/\/www.wsisp.com\/helps\/72099.html","name":"\u81ea\u6258\u7ba1 LLMs \u65f6\uff0c\u4f60\u7684\u670d\u52a1\u5668\u80fd\u627f\u53d7\u591a\u5927\u7684\u538b\u529b\uff1f - \u7f51\u7855\u4e92\u8054\u5e2e\u52a9\u4e2d\u5fc3","isPartOf":{"@id":"https:\/\/www.wsisp.com\/helps\/#website"},"datePublished":"2026-02-04T14:18:14+00:00","dateModified":"2026-02-04T14:18:14+00:00","author":{"@id":"https:\/\/www.wsisp.com\/helps\/#\/schema\/person\/358e386c577a3ab51c4493330a20ad41"},"breadcrumb":{"@id":"https:\/\/www.wsisp.com\/helps\/72099.html#breadcrumb"},"inLanguage":"zh-Hans","potentialAction":[{"@type":"ReadAction","target":["https:\/\/www.wsisp.com\/helps\/72099.html"]}]},{"@type":"BreadcrumbList","@id":"https:\/\/www.wsisp.com\/helps\/72099.html#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"\u9996\u9875","item":"https:\/\/www.wsisp.com\/helps"},{"@type":"ListItem","position":2,"name":"\u81ea\u6258\u7ba1 LLMs \u65f6\uff0c\u4f60\u7684\u670d\u52a1\u5668\u80fd\u627f\u53d7\u591a\u5927\u7684\u538b\u529b\uff1f"}]},{"@type":"WebSite","@id":"https:\/\/www.wsisp.com\/helps\/#website","url":"https:\/\/www.wsisp.com\/helps\/","name":"\u7f51\u7855\u4e92\u8054\u5e2e\u52a9\u4e2d\u5fc3","description":"\u9999\u6e2f\u670d\u52a1\u5668_\u9999\u6e2f\u4e91\u670d\u52a1\u5668\u8d44\u8baf_\u670d\u52a1\u5668\u5e2e\u52a9\u6587\u6863_\u670d\u52a1\u5668\u6559\u7a0b","potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/www.wsisp.com\/helps\/?s={search_term_string}"},"query-input":"required name=search_term_string"}],"inLanguage":"zh-Hans"},{"@type":"Person","@id":"https:\/\/www.wsisp.com\/helps\/#\/schema\/person\/358e386c577a3ab51c4493330a20ad41","name":"admin","image":{"@type":"ImageObject","inLanguage":"zh-Hans","@id":"https:\/\/www.wsisp.com\/helps\/#\/schema\/person\/image\/","url":"https:\/\/gravatar.wp-china-yes.net\/avatar\/?s=96&d=mystery","contentUrl":"https:\/\/gravatar.wp-china-yes.net\/avatar\/?s=96&d=mystery","caption":"admin"},"sameAs":["http:\/\/wp.wsisp.com"],"url":"https:\/\/www.wsisp.com\/helps\/author\/admin"}]}},"_links":{"self":[{"href":"https:\/\/www.wsisp.com\/helps\/wp-json\/wp\/v2\/posts\/72099","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.wsisp.com\/helps\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.wsisp.com\/helps\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.wsisp.com\/helps\/wp-json\/wp\/v2\/users\/2"}],"replies":[{"embeddable":true,"href":"https:\/\/www.wsisp.com\/helps\/wp-json\/wp\/v2\/comments?post=72099"}],"version-history":[{"count":0,"href":"https:\/\/www.wsisp.com\/helps\/wp-json\/wp\/v2\/posts\/72099\/revisions"}],"wp:attachment":[{"href":"https:\/\/www.wsisp.com\/helps\/wp-json\/wp\/v2\/media?parent=72099"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.wsisp.com\/helps\/wp-json\/wp\/v2\/categories?post=72099"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.wsisp.com\/helps\/wp-json\/wp\/v2\/tags?post=72099"},{"taxonomy":"topic","embeddable":true,"href":"https:\/\/www.wsisp.com\/helps\/wp-json\/wp\/v2\/topic?post=72099"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}