{"id":77658,"date":"2026-02-25T04:19:07","date_gmt":"2026-02-24T20:19:07","guid":{"rendered":"https:\/\/www.wsisp.com\/helps\/77658.html"},"modified":"2026-02-25T04:19:07","modified_gmt":"2026-02-24T20:19:07","slug":"%e8%80%81%e6%9d%bf%e8%a6%81%e7%9a%84rag%e7%b3%bb%e7%bb%9f%e6%80%bb%e4%b8%a2%e8%af%ad%e4%b9%89%ef%bc%8c%e9%9d%a0langchain%e5%9b%9b%e5%b1%82%e9%98%b2%e5%be%a1%ef%bc%8c%e5%86%8d%e4%b9%9f%e4%b8%8d","status":"publish","type":"post","link":"https:\/\/www.wsisp.com\/helps\/77658.html","title":{"rendered":"\u8001\u677f\u8981\u7684RAG\u7cfb\u7edf\u603b\u4e22\u8bed\u4e49\uff0c\u9760LangChain\u56db\u5c42\u9632\u5fa1\uff0c\u518d\u4e5f\u4e0d\u7528\u80cc\u9505\uff01"},"content":{"rendered":"<h2>LangChain\u8bed\u4e49\u4fdd\u7559\u79d8\u7c4d&#xff1a;\u4ece\u539f\u7406\u5230\u5b9e\u6218\u7684\u7ec8\u6781\u6307\u5357<\/h2>\n<p><img decoding=\"async\" src=\"https:\/\/www.wsisp.com\/helps\/wp-content\/uploads\/2026\/02\/20260224201853-699e07ade676f.gif\" alt=\"\u8bf7\u6dfb\u52a0\u56fe\u7247\u63cf\u8ff0\" \/><\/p>\n<\/p>\n<h4>\u6587\u7ae0\u76ee\u5f55<\/h4>\n<ul>\n<li>LangChain\u8bed\u4e49\u4fdd\u7559\u79d8\u7c4d&#xff1a;\u4ece\u539f\u7406\u5230\u5b9e\u6218\u7684\u7ec8\u6781\u6307\u5357<\/li>\n<li>\n<ul>\n<li>\u4e00\u3001\u8bed\u4e49\u4e22\u5931&#xff1a;RAG\u7cfb\u7edf\u7684\u201c\u9690\u5f62\u6740\u624b\u201d<\/li>\n<li>\n<ul>\n<li>1.1 \u4ec0\u4e48\u662f\u8bed\u4e49\u4e22\u5931&#xff1f;\u2014\u2014\u4e0d\u4ec5\u4ec5\u662f\u201c\u4fe1\u606f\u88ab\u5207\u65ad\u201d<\/li>\n<li>1.2 \u8bed\u4e49\u4e22\u5931\u7684\u6839\u6e90\u63a2\u7a76&#xff08;\u5168\u94fe\u8def\u62c6\u89e3&#xff09;<\/li>\n<li>\u5173\u952e\u8ba4\u77e5\u66f4\u65b0&#xff1a;<\/li>\n<\/ul>\n<\/li>\n<li>\u4e8c\u3001LangChain TextSplitter\u6df1\u5ea6\u89e3\u6790\u4e0e\u6700\u4f73\u5b9e\u8df5<\/li>\n<li>\n<ul>\n<li>2.1 \u4e2d\u6587\u4f18\u5316\u7684RecursiveCharacterTextSplitter<\/li>\n<li>\u6838\u5fc3\u914d\u7f6e\u8bf4\u660e&#xff1a;<\/li>\n<li>\u5b9e\u6218Python\u4ee3\u7801\u793a\u4f8b<\/li>\n<\/ul>\n<\/li>\n<li>\u4e09\u3001\u4e94\u5927\u5de5\u7a0b\u7b56\u7565&#xff1a;\u7aef\u5230\u7aef\u8bed\u4e49\u4fdd\u7559\u89e3\u51b3\u65b9\u6848<\/li>\n<li>\n<ul>\n<li>3.1 <font color=\"red\">**\u57fa\u7840\u7b56\u7565&#xff1a;\u4f18\u5148\u7ea7\u5206\u9694\u7b26 &#043; \u9012\u5f52\u5207\u5206&#xff08;\u5fc5\u9009&#xff09;**<\/font><\/li>\n<li>3.2 <font color=\"orange\">**\u8fdb\u9636\u7b56\u7565&#xff1a;\u91cd\u53e0\u7a97\u53e3&#xff08;\u8bed\u4e49\u8865\u507f\u6838\u5fc3&#xff09;**<\/font><\/li>\n<li>3.3 <font color=\"orange\">**\u8fdb\u9636\u7b56\u7565&#xff1a;\u81ea\u5b9a\u4e49\u8bed\u4e49Splitter\u2014\u2014\u4e2d\u6587\u573a\u666f\u7684\u7cbe\u51c6\u89e3\u51b3\u65b9\u6848**<\/font><\/li>\n<li>3.4 <font color=\"purple\">**\u9ad8\u7ea7\u7b56\u7565&#xff1a;\u5143\u6570\u636e\u624b\u52a8\u8865\u5145\u2014\u2014\u68c0\u7d22\u7cbe\u5ea6\u589e\u5f3a\u65b9\u6848**<\/font><\/li>\n<li>3.5 <font color=\"purple\">**\u9ad8\u7ea7\u7b56\u7565&#xff1a;\u7236\u6587\u6863\u68c0\u7d22\u2014\u2014\u957f\u6587\u6863\u8bed\u4e49\u4fdd\u7559\u7ec8\u6781\u65b9\u6848**<\/font><\/li>\n<li><font color=\"blue\">**\u573a\u666f\u5316\u9009\u578b\u6307\u5357**<\/font><\/li>\n<li><font color=\"blue\">**A\/B\u6d4b\u8bd5\u6846\u67b6**<\/font><\/li>\n<\/ul>\n<\/li>\n<li>\u56db\u3001\u8bed\u4e49\u4e22\u5931\u5de5\u4e1a\u7ea7\u89e3\u51b3\u65b9\u6848&#xff1a;\u201c\u9884\u5904\u7406&#043;\u7cbe\u51c6\u5316\u5207\u5206&#043;\u9ad8\u9636\u8865\u507f\u201d\u4e09\u5c42\u9632\u5fa1\u4f53\u7cfb<\/li>\n<li>\n<ul>\n<li>4.1 <font color=\"green\">**\u7b2c\u4e00\u5c42&#xff1a;\u9884\u5904\u7406\u2014\u2014\u7528\u5916\u90e8\u5de5\u5177\u5b9e\u73b0 \u201c\u7ed3\u6784\u8fd8\u539f\u201d**<\/font><\/li>\n<li>4.2 <font color=\"blue\">**\u7b2c\u4e8c\u5c42&#xff1a;\u5207\u5206\u2014\u2014LangChain\u7cbe\u7ec6\u5316\u914d\u7f6e\u5b9e\u73b0 \u201c\u7cbe\u51c6\u5207\u5206\u201d**<\/font><\/li>\n<li>4.3 <font color=\"orange\">**\u7b2c\u4e09\u5c42&#xff1a;\u8865\u507f\u2014\u2014\u68c0\u7d22\u7aef\u7684\u8bed\u4e49\u65ad\u88c2\u4fee\u590d**<\/font><\/li>\n<\/ul>\n<\/li>\n<li>\u4e94\u3001\u62d3\u5c55\u65b9\u6848&#xff1a;\u8d85\u8d8a\u57fa\u7840\u7684\u8bed\u4e49\u4fdd\u7559\u79d8\u7c4d<\/li>\n<li>\n<ul>\n<li><font color=\"red\">**\u62d3\u5c55\u65b9\u68481&#xff1a;\u591a\u6a21\u6001\u8bed\u4e49\u4fdd\u7559\u65b9\u6848**<\/font><\/li>\n<li><font color=\"blue\">**\u62d3\u5c55\u65b9\u68482&#xff1a;\u52a8\u6001\u8bed\u4e49\u8c03\u6574\u65b9\u6848**<\/font><\/li>\n<li><font color=\"green\">**\u62d3\u5c55\u65b9\u68483&#xff1a;\u8de8\u6587\u6863\u8bed\u4e49\u5173\u8054\u65b9\u6848**<\/font><\/li>\n<\/ul>\n<\/li>\n<li>\u516d\u3001\u5e38\u89c1\u95ee\u9898\u4e0e\u89e3\u51b3\u65b9\u6848<\/li>\n<li>\n<ul>\n<li>6.1 \u5982\u4f55\u5904\u7406\u65e0\u6362\u884c\u7684\u957f\u6587\u672c&#xff1f;<\/li>\n<li>\u6838\u5fc3\u6b65\u9aa4&#xff1a;<\/li>\n<li>6.2 \u5982\u4f55\u5904\u7406\u591a\u8bed\u8a00\u6df7\u5408\u6587\u672c&#xff1f;<\/li>\n<\/ul>\n<\/li>\n<li>\u4e03\u3001\u4e92\u52a8\u73af\u8282&#xff1a;\u4f60\u7684\u8bed\u4e49\u4fdd\u7559\u6311\u6218&#xff1f;<\/li>\n<li>\n<ul>\n<li>7.1 <font color=\"purple\">**\u4e92\u52a8\u5f15\u5bfc**<\/font><\/li>\n<li>7.2 <font color=\"green\">**\u8f6c\u8f7d\u58f0\u660e**<\/font><\/li>\n<\/ul>\n<\/li>\n<\/ul>\n<\/li>\n<\/ul>\n<h3>\u4e00\u3001\u8bed\u4e49\u4e22\u5931&#xff1a;RAG\u7cfb\u7edf\u7684\u201c\u9690\u5f62\u6740\u624b\u201d<\/h3>\n<h4>1.1 \u4ec0\u4e48\u662f\u8bed\u4e49\u4e22\u5931&#xff1f;\u2014\u2014\u4e0d\u4ec5\u4ec5\u662f\u201c\u4fe1\u606f\u88ab\u5207\u65ad\u201d<\/h4>\n<p>\u5728\u6784\u5efaRAG&#xff08;\u68c0\u7d22\u589e\u5f3a\u751f\u6210&#xff09;\u7cfb\u7edf\u65f6&#xff0c;\u8bed\u4e49\u4e22\u5931\u662f\u4e00\u4e2a\u666e\u904d\u5b58\u5728\u4f46\u5e38\u88ab\u4f4e\u4f30\u7684\u6838\u5fc3\u95ee\u9898\u3002\u4e3a\u9002\u914d\u5411\u91cf\u6570\u636e\u5e93\u5b58\u50a8\u548cLLM\u8f93\u5165\u9650\u5236&#xff0c;\u6587\u6863\u5fc5\u987b\u5206\u5272\u4e3a\u4e00\u4e2a\u4e00\u4e2a\u5c0f\u5757&#xff08;Chunk&#xff09;&#xff0c;\u5206\u5272\u8fc7\u7a0b\u4e2d&#xff0c;\u5f88\u5bb9\u6613\u51fa\u73b0\u8bed\u4e49\u4e22\u5931\u3002<\/p>\n<p><font color=\"red\">\u8bed\u4e49\u4e22\u5931\u73b0\u8c61&#xff0c;\u5e76\u975e\u7b80\u5355\u7684\u201c\u4fe1\u606f\u7247\u6bb5\u7f3a\u5931\u201d&#xff0c;\u800c\u662f\u6307\u6587\u6863\u5728\u5206\u5272\u4e3a\u5c0f\u5757&#xff08;Chunk&#xff09;\u540e&#xff0c;\u539f\u59cb\u6587\u6863\u7684\u903b\u8f91\u8fde\u8d2f\u6027\u3001\u4e0a\u4e0b\u6587\u5173\u8054\u5173\u7cfb\u3001\u5b8c\u6574\u8bed\u4e49\u5355\u5143\u906d\u5230\u7ed3\u6784\u6027\u7834\u574f\u7684\u73b0\u8c61\u3002<\/font><\/p>\n<p>\u8bed\u4e49\u4e22\u5931\u5371\u5bb3\u5728\u4e8e&#xff1a;\u5373\u4fbf\u68c0\u7d22\u7cfb\u7edf\u80fd\u7cbe\u51c6\u5339\u914d\u5230\u76f8\u5173Chunk&#xff0c;LLM\u4e5f\u65e0\u6cd5\u57fa\u4e8e\u788e\u7247\u5316\u7684\u7247\u6bb5\u8fd8\u539f\u539f\u59cb\u8bed\u4e49\u903b\u8f91&#xff0c;\u6700\u7ec8\u5bfc\u81f4\u56de\u7b54\u4e0d\u5b8c\u6574\u3001\u4e0d\u51c6\u786e\u751a\u81f3\u5b8c\u5168\u504f\u79bb\u539f\u610f\u3002\u8fd9\u79cd\u201c\u8bed\u4e49\u7ed3\u6784\u5d29\u584c\u201d\u662fRAG\u7cfb\u7edf\u843d\u5730\u7684\u6838\u5fc3\u969c\u788d\u4e4b\u4e00\u3002<\/p>\n<p><img decoding=\"async\" src=\"https:\/\/www.wsisp.com\/helps\/wp-content\/uploads\/2026\/02\/20260224201854-699e07ae23512.jpg\" alt=\"\u6df7\u4e71\u4ee3\u7801\u56f0\u60d1.jpg\" \/><\/p>\n<h4>1.2 \u8bed\u4e49\u4e22\u5931\u7684\u6839\u6e90\u63a2\u7a76&#xff08;\u5168\u94fe\u8def\u62c6\u89e3&#xff09;<\/h4>\n<p>\u8bed\u4e49\u4e22\u5931\u5e76\u975e\u5355\u4e00\u73af\u8282\u5bfc\u81f4\u7684\u95ee\u9898&#xff0c;\u800c\u662fRAG\u5168\u6d41\u7a0b\u4e2d\u201c\u6587\u672c\u8868\u793a\u5931\u771f\u201d\u7684\u94fe\u5f0f\u53cd\u5e94\u7ed3\u679c\u3002\u5f88\u591a\u4eba\u8bef\u4ee5\u4e3a\u662fTextSplitter\u7684\u8bbe\u8ba1\u7f3a\u9677&#xff0c;\u4f46\u672c\u8d28\u662f\u6574\u4e2a\u6d41\u7a0b\u4e2d\u201c\u7ed3\u6784\u5316\u4fe1\u606f\u2192\u6241\u5e73\u6587\u672c\u2192\u788e\u7247\u5316Chunk\u201d\u7684\u4fe1\u606f\u635f\u8017\u7d2f\u79ef\u3002<\/p>\n<p>\u5177\u4f53\u53ef\u62c6\u89e3\u4e3a\u56db\u4e2a\u6838\u5fc3\u5c42\u6b21&#xff1a;<\/p>\n<table>\n<tr><font color=\"red\">\u5c42\u6b21<\/font><font color=\"red\">\u95ee\u9898\u672c\u8d28<\/font><font color=\"red\">\u5177\u4f53\u8868\u73b0<\/font><font color=\"red\">\u5f71\u54cd\u6df1\u5ea6<\/font><\/tr>\n<tbody>\n<tr>\n<td>\u6587\u672c\u8868\u793a\u5c42<\/td>\n<td>Loader\u5c06\u7ed3\u6784\u5316\/\u534a\u7ed3\u6784\u5316\u6587\u6863&#xff08;PDF\u3001HTML\u3001Word&#xff09;\u6241\u5e73\u5316\u4e3a\u7eaf\u6587\u672c<\/td>\n<td>\u8868\u683c\u53d8\u7ebf\u6027\u6587\u672c\u3001\u591a\u680f\u5e03\u5c40\u6df7\u4e71\u3001\u5b57\u4f53\/\u989c\u8272\u7b49\u8bed\u4e49\u4fe1\u606f\u4e22\u5931<\/td>\n<td>\u9ad8 &#8211; \u539f\u59cb\u7ed3\u6784\u6c38\u4e45\u4e22\u5931<\/td>\n<\/tr>\n<tr>\n<td>\u8bed\u8a00\u7406\u89e3\u5c42<\/td>\n<td>Splitter\u65e0\u6cd5\u7406\u89e3\u81ea\u7136\u8bed\u8a00\u7684\u8bed\u6cd5\u548c\u8bed\u4e49\u7ed3\u6784<\/td>\n<td>\u5728\u53e5\u5b50\u4e2d\u95f4\u3001\u77ed\u8bed\u4e2d\u95f4\u3001\u672f\u8bed\u4e2d\u95f4\u5207\u65ad<\/td>\n<td>\u9ad8 &#8211; \u7834\u574f\u8bed\u8a00\u8fde\u8d2f\u6027<\/td>\n<\/tr>\n<tr>\n<td>\u4e1a\u52a1\u903b\u8f91\u5c42<\/td>\n<td>\u65e0\u6cd5\u8bc6\u522b\u6587\u6863\u7684\u4e1a\u52a1\u8bed\u4e49\u5355\u5143<\/td>\n<td>\u5408\u540c\u6761\u6b3e\u3001\u4ee3\u7801\u5757\u3001\u6570\u5b66\u516c\u5f0f\u3001\u53c2\u8003\u6587\u732e\u88ab\u5272\u88c2<\/td>\n<td>\u4e2d\u9ad8 &#8211; \u4e1a\u52a1\u542b\u4e49\u53d7\u635f<\/td>\n<\/tr>\n<tr>\n<td>\u68c0\u7d22\u8865\u507f\u5c42<\/td>\n<td>\u68c0\u7d22\u65f6\u7f3a\u4e4f\u8db3\u591f\u7684\u4e0a\u4e0b\u6587\u91cd\u5efa\u80fd\u529b<\/td>\n<td>chunk\u4e4b\u95f4\u5173\u8054\u4e22\u5931&#xff0c;LLM\u83b7\u5f97\u788e\u7247\u5316\u4fe1\u606f<\/td>\n<td>\u4e2d &#8211; \u53ef\u90e8\u5206\u901a\u8fc7\u6280\u672f\u5f25\u8865<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<h4>\u5173\u952e\u8ba4\u77e5\u66f4\u65b0&#xff1a;<\/h4>\n<p><font color=\"green\">\u8bed\u4e49\u4e22\u5931\u4e0d\u662f\u5355\u4e00\u73af\u8282\u7684\u95ee\u9898&#xff0c;\u800c\u662f\u9884\u5904\u7406\u2192\u52a0\u8f7d\u2192\u5206\u5272\u2192\u68c0\u7d22\u2192\u751f\u6210\u5168\u94fe\u8def\u7684\u7cfb\u7edf\u6027\u95ee\u9898\u3002\u5355\u7eaf\u4f18\u5316TextSplitter\u53ea\u80fd\u7f13\u89e3\u75c7\u72b6&#xff0c;\u4e0d\u80fd\u6839\u6cbb\u75be\u75c5\u3002\u6211\u4eec\u9700\u8981\u7684\u662f&#034;\u7aef\u5230\u7aef&#034;\u7684\u8bed\u4e49\u4fdd\u7559\u7b56\u7565\u3002<\/font><\/p>\n<p><img decoding=\"async\" src=\"https:\/\/www.wsisp.com\/helps\/wp-content\/uploads\/2026\/02\/20260224201856-699e07b01137e.jpg\" alt=\"\u56db\u5c42\u903b\u8f91\u4ef0\u671b.jpg\" \/><\/p>\n<h3>\u4e8c\u3001LangChain TextSplitter\u6df1\u5ea6\u89e3\u6790\u4e0e\u6700\u4f73\u5b9e\u8df5<\/h3>\n<h4>2.1 \u4e2d\u6587\u4f18\u5316\u7684RecursiveCharacterTextSplitter<\/h4>\n<p>RecursiveCharacterTextSplitter\u662fLangChain\u4e2d\u6700\u5e38\u7528\u7684\u57fa\u7840\u5206\u5757\u5668&#xff0c;\u5176\u6838\u5fc3\u4f18\u52bf\u662f\u901a\u8fc7\u201c\u9012\u5f52\u5c1d\u8bd5\u5206\u9694\u7b26\u201d\u5b9e\u73b0\u5206\u5c42\u5207\u5206&#xff0c;\u9002\u914d\u5927\u591a\u6570\u6587\u672c\u7c7b\u578b\u3002\u9488\u5bf9\u4e2d\u6587\u573a\u666f&#xff0c;\u6838\u5fc3\u4f18\u5316\u65b9\u5411\u662f\u6784\u5efa\u201c\u4e2d\u6587\u8bed\u4e49\u8fb9\u754c\u4f18\u5148\u201d\u7684\u5206\u9694\u7b26\u5217\u8868&#xff0c;\u540c\u65f6\u52a8\u6001\u8c03\u6574\u5757\u5927\u5c0f&#xff08;chunk_size&#xff09;\u548c\u91cd\u53e0\u7387&#xff08;chunk_overlap&#xff09;&#xff0c;\u51cf\u5c11\u8bed\u4e49\u65ad\u88c2\u3002<\/p>\n<p><font color=\"blue\">\u4e2d\u6587\u4f18\u5316\u7684\u6838\u5fc3\u601d\u8def&#xff1a;\u4e2d\u6587\u8bed\u4e49\u8fb9\u754c\u4e0e\u82f1\u6587\u5b58\u5728\u663e\u8457\u5dee\u5f02&#xff0c;\u9700\u4f18\u5148\u6309\u4e2d\u6587\u7279\u6709\u7684\u6807\u70b9\u7b26\u53f7\u548c\u6587\u672c\u7ed3\u6784\u5207\u5206&#xff0c;\u5177\u4f53\u4f18\u5148\u7ea7\u6392\u5e8f\u4e3a&#xff1a;\u6bb5\u843d\u5206\u9694&#xff08;\u7a7a\u884c&#xff09;&gt;\u6362\u884c &gt;\u53e5\u672b\u6807\u70b9&#xff08;\u3002&#xff01;&#xff1f;&#xff09;&gt;\u5206\u53e5\u6807\u70b9&#xff08;&#xff1b;&#xff09;&gt;\u77ed\u8bed\u5206\u9694&#xff08;&#xff0c;\u3001&#xff09;&gt;\u7a7a\u683c\u3002\u901a\u8fc7\u8fd9\u4e00\u4f18\u5148\u7ea7&#xff0c;\u6700\u5927\u5316\u4fdd\u8bc1\u53e5\u5b50\u3001\u77ed\u8bed\u7b49\u57fa\u7840\u8bed\u4e49\u5355\u5143\u7684\u5b8c\u6574\u6027\u3002<\/font><\/p>\n<h4>\u6838\u5fc3\u914d\u7f6e\u8bf4\u660e&#xff1a;<\/h4>\n<p>chunk_size&#xff1a;\u6839\u636e\u6587\u672c\u7c7b\u578b\u52a8\u6001\u8c03\u6574&#xff0c;\u4e2d\u6587\u573a\u666f\u5efa\u8bae\u53c2\u8003&#xff1a;\u77ed\u53e5\u591a\u3001\u8bed\u4e49\u5355\u5143\u5c0f\u7684\u6587\u672c&#xff08;\u5982\u65b0\u95fb\u3001\u535a\u5ba2&#xff09;300-500\u5b57\u7b26&#xff1b;\u6280\u672f\u6587\u6863&#xff08;API\/\u624b\u518c&#xff09;500-800\u5b57\u7b26&#xff08;\u957f\u53e5\u591a&#xff0c;\u672f\u8bed\u5bc6\u96c6&#xff09;&#xff1b;\u6cd5\u5f8b\u5408\u540c&#xff08;\u6761\u6b3e\/\u534f\u8bae&#xff09;800-1200\u5b57\u7b26&#xff08;\u6761\u6b3e\u5b8c\u6574\u5ea6\u8981\u6c42\u9ad8&#xff09;\u3002<\/p>\n<p>chunk_overlap&#xff1a;\u5757\u95f4\u91cd\u53e0\u5927\u5c0f&#xff0c;\u5efa\u8bae\u4e3achunk_size\u768410%-20%\u300210%\u9002\u7528\u4e8e\u8bed\u4e49\u5173\u8054\u6027\u5f31\u7684\u6587\u672c&#xff08;\u65e5\u5fd7\/\u65b0\u95fb&#xff09;&#xff1b;15%\u9002\u7528\u4e8e\u4e2d\u7b49\u5173\u8054\u6027\u6587\u672c&#xff08;\u6280\u672f\u6587\u6863&#xff09;&#xff1b;20%\u9002\u7528\u4e8e\u5f3a\u5173\u8054\u6027\u6587\u672c&#xff08;\u6cd5\u5f8b\/\u5b66\u672f&#xff09;\u3002<\/p>\n<p>separators&#xff1a;\u5206\u9694\u7b26\u5217\u8868&#xff0c;\u4e2d\u6587\u573a\u666f\u5efa\u8bae\u4f7f\u7528\u4f18\u5316\u540e\u7684\u5217\u8868&#xff0c;\u65e0\u9700\u989d\u5916\u914d\u7f6e\u65f6\u53ef\u4f7f\u7528\u4e2d\u6587\u4f18\u5316\u7248\u672c\u3002<\/p>\n<p>is_separator_regex&#xff1a;\u662f\u5426\u5c06\u5206\u9694\u7b26\u89c6\u4e3a\u6b63\u5219\u8868\u8fbe\u5f0f&#xff0c;\u7528\u4e8e\u590d\u6742\u8fb9\u754c\u5339\u914d&#xff08;\u5982\u6cd5\u5f8b\u6761\u6b3e\u3001\u7ae0\u8282\u6807\u9898\u7684\u7cbe\u51c6\u5339\u914d&#xff09;\u3002<\/p>\n<p>length_function&#xff1a;\u957f\u5ea6\u8ba1\u7b97\u51fd\u6570&#xff0c;\u9ed8\u8ba4\u4e3a\u5b57\u7b26\u6570&#xff1b;\u82e5\u9700Token\u8ba1\u6570&#xff0c;\u53ef\u4f20\u5165tiktoken\u8ba1\u6570\u51fd\u6570\u3002<\/p>\n<h4>\u5b9e\u6218Python\u4ee3\u7801\u793a\u4f8b<\/h4>\n<p><span class=\"token keyword\">from<\/span> langchain<span class=\"token punctuation\">.<\/span>text_splitter <span class=\"token keyword\">import<\/span> <span class=\"token operator\">&lt;<\/span>font color<span class=\"token operator\">&#061;<\/span><span class=\"token string\">&#034;blue&#034;<\/span><span class=\"token operator\">&gt;<\/span><span class=\"token operator\">**<\/span>RecursiveCharacterTextSplitter<span class=\"token operator\">**<\/span><span class=\"token operator\">&lt;<\/span><span class=\"token operator\">\/<\/span>font<span class=\"token operator\">&gt;<\/span><\/p>\n<p><span class=\"token comment\"># \u4e2d\u6587\u4f18\u5316\u7684\u5206\u9694\u7b26\u5217\u8868<\/span><br \/>\n<span class=\"token operator\">&lt;<\/span>font color<span class=\"token operator\">&#061;<\/span><span class=\"token string\">&#034;red&#034;<\/span><span class=\"token operator\">&gt;<\/span><span class=\"token operator\">**<\/span>CHINESE_SEPARATORS<span class=\"token operator\">**<\/span><span class=\"token operator\">&lt;<\/span><span class=\"token operator\">\/<\/span>font<span class=\"token operator\">&gt;<\/span> <span class=\"token operator\">&#061;<\/span> <span class=\"token punctuation\">[<\/span><span class=\"token string\">&#034;\\\\n\\\\n&#034;<\/span><span class=\"token punctuation\">,<\/span> <span class=\"token string\">&#034;\\\\n&#034;<\/span><span class=\"token punctuation\">,<\/span> <span class=\"token string\">&#034;\u3002&#034;<\/span><span class=\"token punctuation\">,<\/span> <span class=\"token string\">&#034;&#xff01;&#034;<\/span><span class=\"token punctuation\">,<\/span> <span class=\"token string\">&#034;&#xff1f;&#034;<\/span><span class=\"token punctuation\">,<\/span> <span class=\"token string\">&#034;&#xff1b;&#034;<\/span><span class=\"token punctuation\">,<\/span> <span class=\"token string\">&#034;&#xff0c;&#034;<\/span><span class=\"token punctuation\">,<\/span> <span class=\"token string\">&#034;\u3001&#034;<\/span><span class=\"token punctuation\">,<\/span> <span class=\"token string\">&#034; &#034;<\/span><span class=\"token punctuation\">,<\/span> <span class=\"token string\">&#034;&#034;<\/span><span class=\"token punctuation\">]<\/span><\/p>\n<p><span class=\"token comment\"># \u521d\u59cb\u5316\u4e2d\u6587\u4f18\u5316\u7684RecursiveCharacterTextSplitter<\/span><br \/>\ntext_splitter <span class=\"token operator\">&#061;<\/span> <span class=\"token operator\">&lt;<\/span>font color<span class=\"token operator\">&#061;<\/span><span class=\"token string\">&#034;blue&#034;<\/span><span class=\"token operator\">&gt;<\/span><span class=\"token operator\">**<\/span>RecursiveCharacterTextSplitter<span class=\"token operator\">**<\/span><span class=\"token operator\">&lt;<\/span><span class=\"token operator\">\/<\/span>font<span class=\"token operator\">&gt;<\/span><span class=\"token punctuation\">(<\/span><br \/>\n    separators<span class=\"token operator\">&#061;<\/span><span class=\"token operator\">&lt;<\/span>font color<span class=\"token operator\">&#061;<\/span><span class=\"token string\">&#034;red&#034;<\/span><span class=\"token operator\">&gt;<\/span><span class=\"token operator\">**<\/span>CHINESE_SEPARATORS<span class=\"token operator\">**<\/span><span class=\"token operator\">&lt;<\/span><span class=\"token operator\">\/<\/span>font<span class=\"token operator\">&gt;<\/span><span class=\"token punctuation\">,<\/span><br \/>\n    <span class=\"token operator\">&lt;<\/span>font color<span class=\"token operator\">&#061;<\/span><span class=\"token string\">&#034;orange&#034;<\/span><span class=\"token operator\">&gt;<\/span><span class=\"token operator\">**<\/span>chunk_size<span class=\"token operator\">**<\/span><span class=\"token operator\">&lt;<\/span><span class=\"token operator\">\/<\/span>font<span class=\"token operator\">&gt;&#061;<\/span><span class=\"token number\">600<\/span><span class=\"token punctuation\">,<\/span><br \/>\n    <span class=\"token operator\">&lt;<\/span>font color<span class=\"token operator\">&#061;<\/span><span class=\"token string\">&#034;purple&#034;<\/span><span class=\"token operator\">&gt;<\/span><span class=\"token operator\">**<\/span>chunk_overlap<span class=\"token operator\">**<\/span><span class=\"token operator\">&lt;<\/span><span class=\"token operator\">\/<\/span>font<span class=\"token operator\">&gt;&#061;<\/span><span class=\"token number\">100<\/span><span class=\"token punctuation\">,<\/span><br \/>\n    <span class=\"token operator\">&lt;<\/span>font color<span class=\"token operator\">&#061;<\/span><span class=\"token string\">&#034;blue&#034;<\/span><span class=\"token operator\">&gt;<\/span><span class=\"token operator\">**<\/span>length_function<span class=\"token operator\">**<\/span><span class=\"token operator\">&lt;<\/span><span class=\"token operator\">\/<\/span>font<span class=\"token operator\">&gt;&#061;<\/span><span class=\"token builtin\">len<\/span><span class=\"token punctuation\">,<\/span><br \/>\n    <span class=\"token operator\">&lt;<\/span>font color<span class=\"token operator\">&#061;<\/span><span class=\"token string\">&#034;orange&#034;<\/span><span class=\"token operator\">&gt;<\/span><span class=\"token operator\">**<\/span>is_separator_regex<span class=\"token operator\">**<\/span><span class=\"token operator\">&lt;<\/span><span class=\"token operator\">\/<\/span>font<span class=\"token operator\">&gt;&#061;<\/span><span class=\"token boolean\">False<\/span><br \/>\n<span class=\"token punctuation\">)<\/span><\/p>\n<p><span class=\"token comment\"># \u793a\u4f8b\u6587\u672c<\/span><br \/>\nsample_text <span class=\"token operator\">&#061;<\/span> <span class=\"token string\">&#034;\u5728\u6784\u5efaRAG\u7cfb\u7edf\u65f6&#xff0c;\u8bed\u4e49\u4e22\u5931\u662f\u4e00\u4e2a\u666e\u904d\u5b58\u5728\u4f46\u5e38\u88ab\u4f4e\u4f30\u7684\u6838\u5fc3\u95ee\u9898\u3002\u4e3a\u9002\u914d\u5411\u91cf\u6570\u636e\u5e93\u5b58\u50a8\u548cLLM\u8f93\u5165\u9650\u5236&#xff0c;\u6587\u6863\u5fc5\u987b\u5206\u5272\u4e3a\u4e00\u4e2a\u4e00\u4e2a\u5c0f\u5757&#xff08;Chunk&#xff09;&#xff0c;\u5206\u5272\u8fc7\u7a0b\u4e2d&#xff0c;\u5f88\u5bb9\u6613\u51fa\u73b0\u8bed\u4e49\u4e22\u5931\u3002&#034;<\/span><\/p>\n<p><span class=\"token comment\"># \u6267\u884c\u5206\u5272<\/span><br \/>\nsplit_texts <span class=\"token operator\">&#061;<\/span> text_splitter<span class=\"token punctuation\">.<\/span><span class=\"token operator\">&lt;<\/span>font color<span class=\"token operator\">&#061;<\/span><span class=\"token string\">&#034;green&#034;<\/span><span class=\"token operator\">&gt;<\/span><span class=\"token operator\">**<\/span>split_text<span class=\"token operator\">**<\/span><span class=\"token operator\">&lt;<\/span><span class=\"token operator\">\/<\/span>font<span class=\"token operator\">&gt;<\/span><span class=\"token punctuation\">(<\/span>sample_text<span class=\"token punctuation\">)<\/span><\/p>\n<p><span class=\"token comment\"># \u6253\u5370\u7ed3\u679c<\/span><br \/>\n<span class=\"token keyword\">for<\/span> i<span class=\"token punctuation\">,<\/span> text <span class=\"token keyword\">in<\/span> <span class=\"token builtin\">enumerate<\/span><span class=\"token punctuation\">(<\/span>split_texts<span class=\"token punctuation\">)<\/span><span class=\"token punctuation\">:<\/span><br \/>\n    <span class=\"token keyword\">print<\/span><span class=\"token punctuation\">(<\/span><span class=\"token string-interpolation\"><span class=\"token string\">f&#034;Chunk <\/span><span class=\"token interpolation\"><span class=\"token punctuation\">{<\/span>i<span class=\"token operator\">&#043;<\/span><span class=\"token number\">1<\/span><span class=\"token punctuation\">}<\/span><\/span><span class=\"token string\">:\\\\n<\/span><span class=\"token interpolation\"><span class=\"token punctuation\">{<\/span>text<span class=\"token punctuation\">}<\/span><\/span><span class=\"token string\">\\\\n&#034;<\/span><\/span><span class=\"token punctuation\">)<\/span><\/p>\n<p><img decoding=\"async\" src=\"https:\/\/www.wsisp.com\/helps\/wp-content\/uploads\/2026\/02\/20260224201857-699e07b1d6f3e.jpg\" alt=\"\u9f7f\u8f6e\u4f18\u5316\u6a21\u578b.jpg\" \/><\/p>\n<h3>\u4e09\u3001\u4e94\u5927\u5de5\u7a0b\u7b56\u7565&#xff1a;\u7aef\u5230\u7aef\u8bed\u4e49\u4fdd\u7559\u89e3\u51b3\u65b9\u6848<\/h3>\n<p>\u8bed\u4e49\u4e22\u5931\u662f\u5168\u94fe\u8def\u95ee\u9898&#xff0c;\u9700\u4ece\u201c\u5206\u5272\u524d\u3001\u5206\u5272\u4e2d\u3001\u5206\u5272\u540e\u201d\u4e09\u4e2a\u9636\u6bb5\u6784\u5efa\u89e3\u51b3\u65b9\u6848\u3002\u4ee5\u4e0b\u4e94\u5927\u7b56\u7565\u5c42\u5c42\u9012\u8fdb&#xff0c;\u4ece\u57fa\u7840\u4f18\u5316\u5230\u9ad8\u7ea7\u589e\u5f3a&#xff0c;\u8986\u76d6\u4ece\u4e2d\u5c0f\u89c4\u6a21\u5230\u5927\u89c4\u6a21\u751f\u4ea7\u73af\u5883\u7684\u9700\u6c42\u3002<\/p>\n<h4>3.1 <font color=\"red\">\u57fa\u7840\u7b56\u7565&#xff1a;\u4f18\u5148\u7ea7\u5206\u9694\u7b26 &#043; \u9012\u5f52\u5207\u5206&#xff08;\u5fc5\u9009&#xff09;<\/font><\/h4>\n<p>\u8fd9\u662f\u8bed\u4e49\u4fdd\u7559\u7684\u57fa\u7840&#xff0c;\u6838\u5fc3\u662f\u901a\u8fc7\u201c\u8bed\u4e49\u8fb9\u754c\u4f18\u5148\u7ea7\u6392\u5e8f\u201d\u8ba9\u5206\u5272\u5668\u5728\u81ea\u7136\u8fb9\u754c\u5207\u5272&#xff0c;\u907f\u514d\u786c\u5207\u5bfc\u81f4\u7684\u8bed\u4e49\u65ad\u88c2\u3002\u9002\u7528\u4e8e\u6240\u6709\u4e2d\u6587\u573a\u666f&#xff0c;\u662f\u540e\u7eed\u9ad8\u7ea7\u7b56\u7565\u7684\u57fa\u7840\u3002<\/p>\n<h4>3.2 <font color=\"orange\">\u8fdb\u9636\u7b56\u7565&#xff1a;\u91cd\u53e0\u7a97\u53e3&#xff08;\u8bed\u4e49\u8865\u507f\u6838\u5fc3&#xff09;<\/font><\/h4>\n<p>\u91cd\u53e0\u7a97\u53e3\u7684\u6838\u5fc3\u4f5c\u7528\u662f\u201c\u4e0a\u4e0b\u6587\u8865\u507f\u201d\u2014\u2014\u901a\u8fc7\u8ba9\u76f8\u90bbChunk\u4fdd\u7559\u90e8\u5206\u91cd\u590d\u5185\u5bb9&#xff0c;\u89e3\u51b3\u201c\u5fc5\u987b\u5207\u5272\u957f\u6587\u672c\u201d\u5bfc\u81f4\u7684\u8bed\u4e49\u65ad\u88c2\u3002\u4f46\u7b80\u5355\u7684\u56fa\u5b9a\u91cd\u53e0\u7387\u4f1a\u5bfc\u81f4\u4fe1\u606f\u91cd\u590d\u6216\u8865\u507f\u4e0d\u8db3&#xff0c;\u667a\u80fd\u91cd\u53e0\u9700\u6839\u636e\u6587\u672c\u7279\u5f81\u52a8\u6001\u8c03\u6574\u3002<\/p>\n<h4>3.3 <font color=\"orange\">\u8fdb\u9636\u7b56\u7565&#xff1a;\u81ea\u5b9a\u4e49\u8bed\u4e49Splitter\u2014\u2014\u4e2d\u6587\u573a\u666f\u7684\u7cbe\u51c6\u89e3\u51b3\u65b9\u6848<\/font><\/h4>\n<p>\u57fa\u7840\u7b56\u7565\u4e0e\u91cd\u53e0\u7a97\u53e3\u4ecd\u5b58\u5728\u5c40\u9650\u6027&#xff1a;\u65e0\u6cd5\u7cbe\u51c6\u8bc6\u522b\u4e2d\u6587\u7279\u6709\u7684\u8bed\u4e49\u8fb9\u754c&#xff08;\u5982\u957f\u53e5\u5185\u7684\u903b\u8f91\u505c\u987f\u3001\u6b67\u4e49\u53e5\u7684\u8bed\u4e49\u5355\u5143&#xff09;\u3002\u81ea\u5b9a\u4e49\u8bed\u4e49Splitter\u57fa\u4e8e\u4e2d\u6587\u5206\u8bcd\/\u53e5\u6cd5\u5206\u6790\u6a21\u578b&#xff08;\u5982HanLP\u3001Spacy&#xff09;&#xff0c;\u5b9e\u73b0\u201c\u53e5\u5b50\u7ea7\u7cbe\u51c6\u5206\u5272\u201d&#xff0c;\u4ece\u6839\u6e90\u4e0a\u907f\u514d\u8bed\u4e49\u5355\u5143\u88ab\u5207\u65ad\u3002<\/p>\n<h4>3.4 <font color=\"purple\">\u9ad8\u7ea7\u7b56\u7565&#xff1a;\u5143\u6570\u636e\u624b\u52a8\u8865\u5145\u2014\u2014\u68c0\u7d22\u7cbe\u5ea6\u589e\u5f3a\u65b9\u6848<\/font><\/h4>\n<p>\u8bed\u4e49\u4fdd\u7559\u4e0d\u4ec5\u9700\u8981\u201c\u5206\u5272\u65f6\u4fdd\u7559\u5b8c\u6574\u8bed\u4e49\u201d&#xff0c;\u8fd8\u9700\u8981\u201c\u68c0\u7d22\u65f6\u7cbe\u51c6\u5339\u914d\u8bed\u4e49\u201d\u3002\u5143\u6570\u636e\u8865\u5145\u901a\u8fc7\u4e3a\u6bcf\u4e2aChunk\u6dfb\u52a0\u201c\u4e1a\u52a1\u8bed\u4e49\u6807\u7b7e\u201d&#xff08;\u5982\u6587\u6863\u7c7b\u578b\u3001\u7ae0\u8282\u3001\u5173\u952e\u8bcd\u3001\u9875\u7801&#xff09;&#xff0c;\u8ba9\u68c0\u7d22\u7cfb\u7edf\u80fd\u901a\u8fc7\u5143\u6570\u636e\u8fc7\u6ee4\u65e0\u5173Chunk&#xff0c;\u7cbe\u51c6\u5b9a\u4f4d\u6838\u5fc3\u8bed\u4e49\u5185\u5bb9&#xff0c;\u907f\u514d\u56e0\u788e\u7247\u5316Chunk\u5bfc\u81f4\u7684\u68c0\u7d22\u504f\u5dee\u3002<\/p>\n<h4>3.5 <font color=\"purple\">\u9ad8\u7ea7\u7b56\u7565&#xff1a;\u7236\u6587\u6863\u68c0\u7d22\u2014\u2014\u957f\u6587\u6863\u8bed\u4e49\u4fdd\u7559\u7ec8\u6781\u65b9\u6848<\/font><\/h4>\n<p>\u5bf9\u4e8e\u4e07\u5b57\u5408\u540c\u3001\u767e\u9875\u8bba\u6587\u7b49\u8d85\u957f\u6587\u6863&#xff0c;\u5355\u4e00\u5206\u5757\u7b56\u7565\u96be\u4ee5\u5e73\u8861\u201c\u68c0\u7d22\u7cbe\u51c6\u5ea6\u201d\u548c\u201c\u4e0a\u4e0b\u6587\u5b8c\u6574\u6027\u201d&#xff1a;\u5c0f\u7c92\u5ea6\u5206\u5757\u68c0\u7d22\u7cbe\u51c6\u4f46\u8bed\u4e49\u788e\u7247\u5316&#xff0c;\u5927\u7c92\u5ea6\u5206\u5757\u4e0a\u4e0b\u6587\u5b8c\u6574\u4f46\u68c0\u7d22\u5197\u4f59\u3002\u7236\u6587\u6863\u68c0\u7d22\u91c7\u7528\u201c\u5b50\u5757\u68c0\u7d22 &#043; \u7236\u5757\u751f\u6210\u201d\u7684\u53cc\u5c42\u8bbe\u8ba1&#xff0c;\u5b8c\u7f8e\u89e3\u51b3\u8fd9\u4e00\u77db\u76fe\u3002<\/p>\n<p><font color=\"red\">\u4e94\u5927\u7b56\u7565\u6838\u5fc3\u603b\u7ed3&#xff1a;<\/font><\/p>\n<table>\n<tr>\u7b56\u7565\u8bed\u4e49\u5b8c\u6574\u6027\u68c0\u7d22\u7cbe\u5ea6\u5b9e\u73b0\u590d\u6742\u5ea6\u9002\u7528\u573a\u666f<\/tr>\n<tbody>\n<tr>\n<td><font color=\"red\">\u57fa\u7840\u7b56\u7565<\/font><\/td>\n<td>\u2605\u2605\u2605<\/td>\n<td>\u2605\u2605<\/td>\n<td>\u4f4e<\/td>\n<td>\u7b80\u5355\u6587\u672c\u3001\u65e5\u5fd7<\/td>\n<\/tr>\n<tr>\n<td><font color=\"orange\">\u91cd\u53e0\u7a97\u53e3<\/font><\/td>\n<td>\u2605\u2605\u2605\u2605<\/td>\n<td>\u2605\u2605\u2605<\/td>\n<td>\u4e2d\u4f4e<\/td>\n<td>\u4e2d\u7b49\u957f\u5ea6\u6587\u6863<\/td>\n<\/tr>\n<tr>\n<td><font color=\"green\">\u81ea\u5b9a\u4e49\u8bed\u4e49Splitter<\/font><\/td>\n<td>\u2605\u2605\u2605\u2605\u2605<\/td>\n<td>\u2605\u2605\u2605\u2605<\/td>\n<td>\u4e2d\u9ad8<\/td>\n<td>\u6cd5\u5f8b\u5408\u540c\u3001\u5b66\u672f\u8bba\u6587<\/td>\n<\/tr>\n<tr>\n<td><font color=\"blue\">\u5143\u6570\u636e\u8865\u5145<\/font><\/td>\n<td>\u2605\u2605\u2605<\/td>\n<td>\u2605\u2605\u2605\u2605\u2605<\/td>\n<td>\u4e2d<\/td>\n<td>\u7ed3\u6784\u5316\u6587\u6863\u6df7\u5408\u68c0\u7d22<\/td>\n<\/tr>\n<tr>\n<td><font color=\"purple\">\u7236\u6587\u6863\u68c0\u7d22<\/font><\/td>\n<td>\u2605\u2605\u2605\u2605\u2605<\/td>\n<td>\u2605\u2605\u2605\u2605\u2605<\/td>\n<td>\u9ad8<\/td>\n<td>\u8d85\u957f\u6587\u6863\u3001\u9ad8\u7cbe\u5ea6\u95ee\u7b54<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<p><img decoding=\"async\" src=\"https:\/\/www.wsisp.com\/helps\/wp-content\/uploads\/2026\/02\/20260224201859-699e07b3d25b2.jpg\" alt=\"\u7b56\u7565\u9636\u68af.jpg\" \/><\/p>\n<h4><font color=\"blue\">\u573a\u666f\u5316\u9009\u578b\u6307\u5357<\/font><\/h4>\n<li>\n<p><font color=\"red\">\u7b80\u5355\u6587\u6863&#xff08;\u65e5\u5fd7\u3001\u65b0\u95fb&#xff09;<\/font>&#xff1a;\u57fa\u7840\u7b56\u7565 &#043; \u91cd\u53e0\u7a97\u53e3&#xff0c;\u517c\u987e\u6548\u7387\u4e0e\u57fa\u7840\u8bed\u4e49\u4fdd\u7559\u3002<\/p>\n<\/li>\n<li>\n<p><font color=\"blue\">\u6280\u672f \/\u4ee3\u7801\u6587\u6863<\/font>&#xff1a;\u57fa\u7840\u7b56\u7565 &#043; \u81ea\u5b9a\u4e49\u5206\u9694\u7b26 &#043; \u91cd\u53e0\u7a97\u53e3&#xff0c;\u4fdd\u62a4\u4ee3\u7801\u5757\u5b8c\u6574\u6027\u3002<\/p>\n<\/li>\n<li>\n<p><font color=\"green\">\u6cd5\u5f8b \/\u653f\u7b56\u6587\u4ef6<\/font>&#xff1a;\u81ea\u5b9a\u4e49\u8bed\u4e49 Splitter &#043; \u5143\u6570\u636e\u8865\u5145&#xff0c;\u5b9e\u73b0\u7cbe\u51c6\u5206\u5272\u4e0e\u68c0\u7d22\u3002<\/p>\n<\/li>\n<li>\n<p><font color=\"orange\">\u5b66\u672f\u8bba\u6587<\/font>&#xff1a;\u7236\u6587\u6863\u68c0\u7d22 &#043; \u81ea\u5b9a\u4e49\u8bed\u4e49 Splitter&#xff0c;\u5e73\u8861\u68c0\u7d22\u7cbe\u5ea6\u4e0e\u4e0a\u4e0b\u6587\u5b8c\u6574\u6027\u3002<\/p>\n<\/li>\n<li>\n<p><font color=\"purple\">\u5927\u89c4\u6a21\u751f\u4ea7\u7cfb\u7edf<\/font>&#xff1a;\u5206\u5c42\u52a8\u6001\u7b56\u7565&#xff0c;\u6309\u6587\u6863\u7c7b\u578b\u81ea\u52a8\u5339\u914d\u5206\u5272\u65b9\u6848\u3002<\/p>\n<\/li>\n<h4><font color=\"blue\">A\/B\u6d4b\u8bd5\u6846\u67b6<\/font><\/h4>\n<p>A\/B\u6d4b\u8bd5\u7528\u4e8e\u6bd4\u8f83\u4e0d\u540c\u5206\u5272\u7b56\u7565\u7684\u6548\u679c&#xff0c;\u6838\u5fc3\u6b65\u9aa4\u5982\u4e0b&#xff1a;<\/p>\n<li>\n<p><font color=\"red\">\u51c6\u5907\u6d4b\u8bd5\u6570\u636e\u96c6<\/font>&#xff1a;\u4e00\u7ec4\u6587\u6863\u548c\u5bf9\u5e94\u7684\u95ee\u9898-\u7b54\u6848\u5bf9&#xff08;\u9700\u8db3\u591f\u6837\u672c\u91cf\u4fdd\u8bc1\u7ed3\u679c\u53ef\u9760\u6027&#xff09;&#xff1b;<\/p>\n<\/li>\n<li>\n<p><font color=\"blue\">\u6784\u5efa\u6d4b\u8bd5\u73af\u5883<\/font>&#xff1a;\u4f7f\u7528\u4e24\u79cd\u5206\u5272\u7b56\u7565\u5206\u522b\u5904\u7406\u6587\u6863&#xff0c;\u6784\u5efa\u4e24\u4e2a\u5411\u91cf\u6570\u636e\u5e93&#xff08;\u5176\u4ed6\u7ec4\u4ef6\u5982\u5d4c\u5165\u6a21\u578b\u3001\u68c0\u7d22\u5668\u3001LLM\u4fdd\u6301\u4e00\u81f4&#xff0c;\u63a7\u5236\u53d8\u91cf&#xff09;&#xff1b;<\/p>\n<\/li>\n<li>\n<p><font color=\"green\">\u6267\u884c\u6d4b\u8bd5<\/font>&#xff1a;\u5bf9\u6bcf\u4e2a\u95ee\u9898&#xff0c;\u5206\u522b\u83b7\u53d6\u4e24\u79cd\u7b56\u7565\u4e0b\u7684\u6a21\u578b\u7b54\u6848&#xff1b;<\/p>\n<\/li>\n<li>\n<p><font color=\"orange\">\u8bc4\u4f30\u6548\u679c<\/font>&#xff1a;\u901a\u8fc7\u81ea\u52a8\u8bc4\u4f30\u6307\u6807&#xff08;\u5982\u4e0e\u6807\u51c6\u7b54\u6848\u7684\u76f8\u4f3c\u5ea6&#xff09;\u6216\u4eba\u5de5\u8bc4\u4f30&#xff08;\u5982\u8bed\u4e49\u5b8c\u6574\u6027\u3001\u56de\u7b54\u51c6\u786e\u6027&#xff09;\u6253\u5206&#xff1b;<\/p>\n<\/li>\n<li>\n<p><font color=\"purple\">\u5206\u6790\u7ed3\u679c<\/font>&#xff1a;\u5bf9\u6bd4\u4e24\u79cd\u7b56\u7565\u7684\u5f97\u5206&#xff0c;\u9009\u62e9\u6548\u679c\u66f4\u4f18\u7684\u65b9\u6848\u3002<\/p>\n<\/li>\n<h3>\u56db\u3001\u8bed\u4e49\u4e22\u5931\u5de5\u4e1a\u7ea7\u89e3\u51b3\u65b9\u6848&#xff1a;\u201c\u9884\u5904\u7406&#043;\u7cbe\u51c6\u5316\u5207\u5206&#043;\u9ad8\u9636\u8865\u507f\u201d\u4e09\u5c42\u9632\u5fa1\u4f53\u7cfb<\/h3>\n<p>\u4e09\u5c42\u9632\u5fa1\u4f53\u7cfb\u7684\u6838\u5fc3\u903b\u8f91&#xff1a;\u4ece\u201c\u7ed3\u6784\u8fd8\u539f\u201d\u5230\u201c\u7cbe\u51c6\u5207\u5206\u201d\u518d\u5230\u201c\u8bed\u4e49\u8865\u507f\u201d&#xff0c;\u5c42\u5c42\u9012\u8fdb\u89e3\u51b3\u6587\u672c\u788e\u7247\u5316\u95ee\u9898&#xff0c;\u786e\u4fdd\u8f93\u5165\u5927\u6a21\u578b\u7684\u6587\u672c\u4fe1\u606f\u5b8c\u6574\u3001\u5173\u8054\u3001\u7cbe\u51c6&#xff0c;\u4e3aA\/B\u6d4b\u8bd5\u63d0\u4f9b\u9ad8\u8d28\u91cf\u7684\u6570\u636e\u57fa\u7840\u3002<\/p>\n<h4>4.1 <font color=\"green\">\u7b2c\u4e00\u5c42&#xff1a;\u9884\u5904\u7406\u2014\u2014\u7528\u5916\u90e8\u5de5\u5177\u5b9e\u73b0 \u201c\u7ed3\u6784\u8fd8\u539f\u201d<\/font><\/h4>\n<p><font color=\"blue\">\u6838\u5fc3\u76ee\u6807<\/font>&#xff1a;\u89e3\u51b3PDF\u3001Excel\u3001Word\u7b49\u975e\u7ed3\u6784\u5316\/\u534a\u7ed3\u6784\u5316\u6587\u6863\u7684\u7ed3\u6784\u4e22\u5931\u95ee\u9898\u3002\u539f\u59cb\u6587\u6863\u7684\u8868\u683c\u3001\u591a\u680f\u3001\u6807\u9898\u5c42\u7ea7\u7b49\u7ed3\u6784\u4fe1\u606f\u82e5\u76f4\u63a5\u4e22\u5f03&#xff0c;\u540e\u7eed\u5207\u5206\u5fc5\u7136\u5bfc\u81f4\u8bed\u4e49\u65ad\u88c2\u3002\u672c\u5c42\u901a\u8fc7\u4e13\u4e1a\u5de5\u5177\u63d0\u53d6\u7ed3\u6784\u7279\u5f81\u5e76\u8f6c\u5316\u4e3a\u7ed3\u6784\u5316\u6587\u672c&#xff0c;\u4e3a\u540e\u7eed\u5207\u5206\u5960\u5b9a\u57fa\u7840\u3002<\/p>\n<h4>4.2 <font color=\"blue\">\u7b2c\u4e8c\u5c42&#xff1a;\u5207\u5206\u2014\u2014LangChain\u7cbe\u7ec6\u5316\u914d\u7f6e\u5b9e\u73b0 \u201c\u7cbe\u51c6\u5207\u5206\u201d<\/font><\/h4>\n<p><font color=\"green\">\u6838\u5fc3\u76ee\u6807<\/font>&#xff1a;\u89e3\u51b3\u201c\u4e00\u5200\u5207\u201d\u5207\u5206\u5bfc\u81f4\u7684\u8bed\u4e49\u65ad\u88c2\u95ee\u9898\u3002\u4e0d\u540c\u6587\u672c\u7c7b\u578b&#xff08;\u6cd5\u5f8b\u5408\u540c\/\u65b0\u95fb\/\u4ee3\u7801&#xff09;\u3001\u4e0d\u540c\u5927\u6a21\u578b\u7684\u4e0a\u4e0b\u6587\u7a97\u53e3\u5927\u5c0f\u548c\u8bed\u4e49\u7406\u89e3\u80fd\u529b\u5b58\u5728\u5dee\u5f02&#xff0c;\u9700\u52a8\u6001\u914d\u7f6e\u5207\u5206\u53c2\u6570&#xff0c;\u786e\u4fdd\u5207\u5206\u540e\u7684\u6587\u672c\u7247\u6bb5\u8bed\u4e49\u5b8c\u6574\u3001\u9002\u914d\u6a21\u578b\u80fd\u529b\u3002<\/p>\n<h4>4.3 <font color=\"orange\">\u7b2c\u4e09\u5c42&#xff1a;\u8865\u507f\u2014\u2014\u68c0\u7d22\u7aef\u7684\u8bed\u4e49\u65ad\u88c2\u4fee\u590d<\/font><\/h4>\n<p><font color=\"purple\">\u6838\u5fc3\u76ee\u6807<\/font>&#xff1a;\u89e3\u51b3\u5207\u5206\u540e\u6587\u672c\u7247\u6bb5\u7684\u8bed\u4e49\u65ad\u88c2\u95ee\u9898\u3002\u5373\u4f7f\u7ecf\u8fc7\u7cbe\u7ec6\u5316\u5207\u5206&#xff0c;\u5355\u4e00\u7247\u6bb5\u4ecd\u53ef\u80fd\u7f3a\u5931\u5173\u952e\u4e0a\u4e0b\u6587&#xff08;\u5982\u6761\u6b3e\u7684\u524d\u7f6e\u6761\u4ef6\u3001\u6570\u636e\u7684\u4e1a\u52a1\u80cc\u666f&#xff09;\u3002\u672c\u5c42\u901a\u8fc7\u201c\u5143\u6570\u636e&#043;\u7236\u6587\u6863\u68c0\u7d22\u201d\u201c\u8bed\u4e49\u589e\u5f3a\u68c0\u7d22\u201d\u7b49\u7b56\u7565&#xff0c;\u53ec\u56de\u76f8\u5173\u7247\u6bb5\u7684\u5b8c\u6574\u4e0a\u4e0b\u6587&#xff0c;\u4e3a\u5927\u6a21\u578b\u63d0\u4f9b\u5168\u9762\u7684\u4fe1\u606f\u652f\u6491\u3002<\/p>\n<p>\u4e09\u5c42\u9632\u5fa1\u4f53\u7cfb\u5b9e\u6218Python\u4ee3\u7801\u793a\u4f8b&#xff1a;<\/p>\n<p><span class=\"token keyword\">from<\/span> langchain<span class=\"token punctuation\">.<\/span>document_loaders <span class=\"token keyword\">import<\/span> <span class=\"token operator\">&lt;<\/span>font color<span class=\"token operator\">&#061;<\/span><span class=\"token string\">&#034;red&#034;<\/span><span class=\"token operator\">&gt;<\/span><span class=\"token operator\">**<\/span>PyPDFLoader<span class=\"token operator\">**<\/span><span class=\"token operator\">&lt;<\/span><span class=\"token operator\">\/<\/span>font<span class=\"token operator\">&gt;<\/span><br \/>\n<span class=\"token keyword\">from<\/span> langchain<span class=\"token punctuation\">.<\/span>text_splitter <span class=\"token keyword\">import<\/span> <span class=\"token operator\">&lt;<\/span>font color<span class=\"token operator\">&#061;<\/span><span class=\"token string\">&#034;blue&#034;<\/span><span class=\"token operator\">&gt;<\/span><span class=\"token operator\">**<\/span>RecursiveCharacterTextSplitter<span class=\"token operator\">**<\/span><span class=\"token operator\">&lt;<\/span><span class=\"token operator\">\/<\/span>font<span class=\"token operator\">&gt;<\/span><br \/>\n<span class=\"token keyword\">from<\/span> langchain<span class=\"token punctuation\">.<\/span>vectorstores <span class=\"token keyword\">import<\/span> <span class=\"token operator\">&lt;<\/span>font color<span class=\"token operator\">&#061;<\/span><span class=\"token string\">&#034;green&#034;<\/span><span class=\"token operator\">&gt;<\/span><span class=\"token operator\">**<\/span>Chroma<span class=\"token operator\">**<\/span><span class=\"token operator\">&lt;<\/span><span class=\"token operator\">\/<\/span>font<span class=\"token operator\">&gt;<\/span><br \/>\n<span class=\"token keyword\">from<\/span> langchain<span class=\"token punctuation\">.<\/span>embeddings <span class=\"token keyword\">import<\/span> <span class=\"token operator\">&lt;<\/span>font color<span class=\"token operator\">&#061;<\/span><span class=\"token string\">&#034;orange&#034;<\/span><span class=\"token operator\">&gt;<\/span><span class=\"token operator\">**<\/span>OpenAIEmbeddings<span class=\"token operator\">**<\/span><span class=\"token operator\">&lt;<\/span><span class=\"token operator\">\/<\/span>font<span class=\"token operator\">&gt;<\/span><\/p>\n<p><span class=\"token comment\"># \u52a0\u8f7dPDF\u6587\u6863<\/span><br \/>\nloader <span class=\"token operator\">&#061;<\/span> PyPDFLoader<span class=\"token punctuation\">(<\/span><span class=\"token string\">&#034;LangChain\u5f7b\u5e95\u89e3\u51b3\u8bed\u4e49\u4fdd\u7559\u4e09\u677f\u65a7&#xff1a;\u4ece\u539f\u7406\u5230\u5b9e\u8df5\u7684\u5b8c\u6574\u89e3\u51b3\u65b9\u6848.pdf&#034;<\/span><span class=\"token punctuation\">)<\/span><br \/>\npages <span class=\"token operator\">&#061;<\/span> loader<span class=\"token punctuation\">.<\/span>load_and_split<span class=\"token punctuation\">(<\/span><span class=\"token punctuation\">)<\/span><\/p>\n<p><span class=\"token comment\"># \u521d\u59cb\u5316\u4e2d\u6587\u4f18\u5316\u7684RecursiveCharacterTextSplitter<\/span><br \/>\ntext_splitter <span class=\"token operator\">&#061;<\/span> RecursiveCharacterTextSplitter<span class=\"token punctuation\">(<\/span><br \/>\n    separators<span class=\"token operator\">&#061;<\/span><span class=\"token punctuation\">[<\/span><span class=\"token string\">&#034;\\\\n\\\\n&#034;<\/span><span class=\"token punctuation\">,<\/span> <span class=\"token string\">&#034;\\\\n&#034;<\/span><span class=\"token punctuation\">,<\/span> <span class=\"token string\">&#034;\u3002&#034;<\/span><span class=\"token punctuation\">,<\/span> <span class=\"token string\">&#034;&#xff01;&#034;<\/span><span class=\"token punctuation\">,<\/span> <span class=\"token string\">&#034;&#xff1f;&#034;<\/span><span class=\"token punctuation\">,<\/span> <span class=\"token string\">&#034;&#xff1b;&#034;<\/span><span class=\"token punctuation\">,<\/span> <span class=\"token string\">&#034;&#xff0c;&#034;<\/span><span class=\"token punctuation\">,<\/span> <span class=\"token string\">&#034;\u3001&#034;<\/span><span class=\"token punctuation\">,<\/span> <span class=\"token string\">&#034; &#034;<\/span><span class=\"token punctuation\">,<\/span> <span class=\"token string\">&#034;&#034;<\/span><span class=\"token punctuation\">]<\/span><span class=\"token punctuation\">,<\/span><br \/>\n    chunk_size<span class=\"token operator\">&#061;<\/span><span class=\"token number\">800<\/span><span class=\"token punctuation\">,<\/span><br \/>\n    chunk_overlap<span class=\"token operator\">&#061;<\/span><span class=\"token number\">100<\/span><span class=\"token punctuation\">,<\/span><br \/>\n    length_function<span class=\"token operator\">&#061;<\/span><span class=\"token builtin\">len<\/span><span class=\"token punctuation\">,<\/span><br \/>\n    is_separator_regex<span class=\"token operator\">&#061;<\/span><span class=\"token boolean\">False<\/span><br \/>\n<span class=\"token punctuation\">)<\/span><\/p>\n<p><span class=\"token comment\"># \u5206\u5272\u6587\u6863<\/span><br \/>\nsplit_docs <span class=\"token operator\">&#061;<\/span> text_splitter<span class=\"token punctuation\">.<\/span><span class=\"token operator\">&lt;<\/span>font color<span class=\"token operator\">&#061;<\/span><span class=\"token string\">&#034;green&#034;<\/span><span class=\"token operator\">&gt;<\/span><span class=\"token operator\">**<\/span>split_documents<span class=\"token operator\">**<\/span><span class=\"token operator\">&lt;<\/span><span class=\"token operator\">\/<\/span>font<span class=\"token operator\">&gt;<\/span><span class=\"token punctuation\">(<\/span>pages<span class=\"token punctuation\">)<\/span><\/p>\n<p><span class=\"token comment\"># \u521b\u5efa\u5411\u91cf\u6570\u636e\u5e93<\/span><br \/>\nembeddings <span class=\"token operator\">&#061;<\/span> OpenAIEmbeddings<span class=\"token punctuation\">(<\/span><span class=\"token punctuation\">)<\/span><br \/>\ndb <span class=\"token operator\">&#061;<\/span> <span class=\"token operator\">&lt;<\/span>font color<span class=\"token operator\">&#061;<\/span><span class=\"token string\">&#034;green&#034;<\/span><span class=\"token operator\">&gt;<\/span><span class=\"token operator\">**<\/span>Chroma<span class=\"token operator\">**<\/span><span class=\"token operator\">&lt;<\/span><span class=\"token operator\">\/<\/span>font<span class=\"token operator\">&gt;<\/span><span class=\"token punctuation\">.<\/span><span class=\"token operator\">&lt;<\/span>font color<span class=\"token operator\">&#061;<\/span><span class=\"token string\">&#034;purple&#034;<\/span><span class=\"token operator\">&gt;<\/span><span class=\"token operator\">**<\/span>from_documents<span class=\"token operator\">**<\/span><span class=\"token operator\">&lt;<\/span><span class=\"token operator\">\/<\/span>font<span class=\"token operator\">&gt;<\/span><span class=\"token punctuation\">(<\/span>split_docs<span class=\"token punctuation\">,<\/span> embeddings<span class=\"token punctuation\">)<\/span><\/p>\n<p><span class=\"token comment\"># \u793a\u4f8b\u68c0\u7d22<\/span><br \/>\nquery <span class=\"token operator\">&#061;<\/span> <span class=\"token string\">&#034;\u4ec0\u4e48\u662f\u8bed\u4e49\u4e22\u5931&#xff1f;&#034;<\/span><br \/>\ndocs <span class=\"token operator\">&#061;<\/span> db<span class=\"token punctuation\">.<\/span><span class=\"token operator\">&lt;<\/span>font color<span class=\"token operator\">&#061;<\/span><span class=\"token string\">&#034;purple&#034;<\/span><span class=\"token operator\">&gt;<\/span><span class=\"token operator\">**<\/span>similarity_search<span class=\"token operator\">**<\/span><span class=\"token operator\">&lt;<\/span><span class=\"token operator\">\/<\/span>font<span class=\"token operator\">&gt;<\/span><span class=\"token punctuation\">(<\/span>query<span class=\"token punctuation\">)<\/span><\/p>\n<p><span class=\"token comment\"># \u6253\u5370\u68c0\u7d22\u7ed3\u679c<\/span><br \/>\n<span class=\"token keyword\">print<\/span><span class=\"token punctuation\">(<\/span><span class=\"token string-interpolation\"><span class=\"token string\">f&#034;\u68c0\u7d22\u5230 <\/span><span class=\"token interpolation\"><span class=\"token punctuation\">{<\/span><span class=\"token builtin\">len<\/span><span class=\"token punctuation\">(<\/span>docs<span class=\"token punctuation\">)<\/span><span class=\"token punctuation\">}<\/span><\/span><span class=\"token string\"> \u4e2a\u76f8\u5173\u6587\u6863&#xff1a;&#034;<\/span><\/span><span class=\"token punctuation\">)<\/span><br \/>\n<span class=\"token keyword\">for<\/span> i<span class=\"token punctuation\">,<\/span> doc <span class=\"token keyword\">in<\/span> <span class=\"token builtin\">enumerate<\/span><span class=\"token punctuation\">(<\/span>docs<span class=\"token punctuation\">)<\/span><span class=\"token punctuation\">:<\/span><br \/>\n    <span class=\"token keyword\">print<\/span><span class=\"token punctuation\">(<\/span><span class=\"token string-interpolation\"><span class=\"token string\">f&#034;\\\\n\u6587\u6863 <\/span><span class=\"token interpolation\"><span class=\"token punctuation\">{<\/span>i<span class=\"token operator\">&#043;<\/span><span class=\"token number\">1<\/span><span class=\"token punctuation\">}<\/span><\/span><span class=\"token string\">:\\\\n<\/span><span class=\"token interpolation\"><span class=\"token punctuation\">{<\/span>doc<span class=\"token punctuation\">.<\/span>page_content<span class=\"token punctuation\">}<\/span><\/span><span class=\"token string\">&#034;<\/span><\/span><span class=\"token punctuation\">)<\/span><\/p>\n<p><img decoding=\"async\" src=\"https:\/\/www.wsisp.com\/helps\/wp-content\/uploads\/2026\/02\/20260224201901-699e07b5ebf10.jpg\" alt=\"\u4e09\u5c42\u9632\u5fa1\u5854\u63d2\u753b.jpg\" \/><\/p>\n<h3>\u4e94\u3001\u62d3\u5c55\u65b9\u6848&#xff1a;\u8d85\u8d8a\u57fa\u7840\u7684\u8bed\u4e49\u4fdd\u7559\u79d8\u7c4d<\/h3>\n<h4><font color=\"red\">\u62d3\u5c55\u65b9\u68481&#xff1a;\u591a\u6a21\u6001\u8bed\u4e49\u4fdd\u7559\u65b9\u6848<\/font><\/h4>\n<p>\u5728\u5b9e\u9645\u5e94\u7528\u4e2d&#xff0c;\u6587\u6863\u4e0d\u4ec5\u5305\u542b\u6587\u672c&#xff0c;\u8fd8\u5305\u542b\u56fe\u7247\u3001\u89c6\u9891\u7b49\u591a\u6a21\u6001\u5185\u5bb9\u3002\u4f20\u7edf\u7684\u6587\u672c\u5206\u5272\u65b9\u6cd5\u65e0\u6cd5\u5904\u7406\u8fd9\u4e9b\u975e\u6587\u672c\u5185\u5bb9&#xff0c;\u5bfc\u81f4\u591a\u6a21\u6001\u8bed\u4e49\u4e22\u5931\u3002<\/p>\n<p><font color=\"red\">\u89e3\u51b3\u65b9\u6848&#xff1a;<\/font><\/p>\n<li><font color=\"blue\">\u4f7f\u7528OCR\u6280\u672f\u63d0\u53d6\u56fe\u7247\u4e2d\u7684\u6587\u672c\u4fe1\u606f<\/font><\/li>\n<li><font color=\"green\">\u4f7f\u7528\u591a\u6a21\u6001\u6a21\u578b&#xff08;\u5982BLIP\u3001CLIP&#xff09;\u5c06\u56fe\u7247\u8f6c\u6362\u4e3a\u8bed\u4e49\u5411\u91cf<\/font><\/li>\n<li><font color=\"orange\">\u5c06\u6587\u672c\u548c\u56fe\u7247\u7684\u8bed\u4e49\u5411\u91cf\u5408\u5e76&#xff0c;\u6784\u5efa\u591a\u6a21\u6001\u5411\u91cf\u6570\u636e\u5e93<\/font><\/li>\n<li><font color=\"purple\">\u68c0\u7d22\u65f6\u540c\u65f6\u5339\u914d\u6587\u672c\u548c\u56fe\u7247\u7684\u8bed\u4e49\u5411\u91cf&#xff0c;\u5b9e\u73b0\u591a\u6a21\u6001\u8bed\u4e49\u4fdd\u7559<\/font><\/li>\n<h4><font color=\"blue\">\u62d3\u5c55\u65b9\u68482&#xff1a;\u52a8\u6001\u8bed\u4e49\u8c03\u6574\u65b9\u6848<\/font><\/h4>\n<p>\u4f20\u7edf\u7684\u6587\u672c\u5206\u5272\u662f\u9759\u6001\u7684&#xff0c;\u4e00\u65e6\u5206\u5272\u5b8c\u6210&#xff0c;\u540e\u7eed\u65e0\u6cd5\u6839\u636e\u5b9e\u65f6\u53cd\u9988\u8c03\u6574\u5207\u5206\u7b56\u7565\u3002\u5728\u5b9e\u9645\u5e94\u7528\u4e2d&#xff0c;\u7528\u6237\u7684\u9700\u6c42\u548c\u6587\u6863\u7684\u8bed\u4e49\u53ef\u80fd\u4f1a\u53d1\u751f\u53d8\u5316&#xff0c;\u9759\u6001\u5206\u5272\u65e0\u6cd5\u9002\u5e94\u8fd9\u4e9b\u53d8\u5316\u3002<\/p>\n<p><font color=\"red\">\u89e3\u51b3\u65b9\u6848&#xff1a;<\/font><\/p>\n<li><font color=\"blue\">\u5b9e\u65f6\u76d1\u63a7\u7528\u6237\u7684\u68c0\u7d22\u548c\u53cd\u9988\u6570\u636e<\/font><\/li>\n<li><font color=\"green\">\u4f7f\u7528\u5f3a\u5316\u5b66\u4e60\u6a21\u578b\u6839\u636e\u5b9e\u65f6\u53cd\u9988\u8c03\u6574\u5207\u5206\u7b56\u7565<\/font><\/li>\n<li><font color=\"orange\">\u52a8\u6001\u8c03\u6574chunk_size\u3001chunk_overlap\u548c\u5206\u9694\u7b26\u5217\u8868<\/font><\/li>\n<li><font color=\"purple\">\u5b9a\u671f\u91cd\u65b0\u5206\u5272\u6587\u6863&#xff0c;\u786e\u4fdd\u8bed\u4e49\u4fdd\u7559\u7b56\u7565\u59cb\u7ec8\u6700\u4f18<\/font><\/li>\n<h4><font color=\"green\">\u62d3\u5c55\u65b9\u68483&#xff1a;\u8de8\u6587\u6863\u8bed\u4e49\u5173\u8054\u65b9\u6848<\/font><\/h4>\n<p>\u5728\u5b9e\u9645\u5e94\u7528\u4e2d&#xff0c;\u7528\u6237\u7684\u95ee\u9898\u53ef\u80fd\u6d89\u53ca\u591a\u4e2a\u6587\u6863&#xff0c;\u4f20\u7edf\u7684\u5355\u6587\u6863\u5206\u5272\u65e0\u6cd5\u5904\u7406\u8de8\u6587\u6863\u7684\u8bed\u4e49\u5173\u8054\u3002<\/p>\n<p><font color=\"red\">\u89e3\u51b3\u65b9\u6848&#xff1a;<\/font><\/p>\n<li><font color=\"blue\">\u4f7f\u7528\u77e5\u8bc6\u56fe\u8c31\u6280\u672f\u6784\u5efa\u8de8\u6587\u6863\u7684\u8bed\u4e49\u5173\u8054\u7f51\u7edc<\/font><\/li>\n<li><font color=\"green\">\u5728\u5206\u5272\u65f6\u4fdd\u7559\u6587\u6863\u4e4b\u95f4\u7684\u5173\u8054\u4fe1\u606f<\/font><\/li>\n<li><font color=\"orange\">\u68c0\u7d22\u65f6\u540c\u65f6\u8003\u8651\u6587\u6863\u5185\u90e8\u548c\u6587\u6863\u4e4b\u95f4\u7684\u8bed\u4e49\u5173\u8054<\/font><\/li>\n<li><font color=\"purple\">\u751f\u6210\u56de\u7b54\u65f6\u6574\u5408\u591a\u4e2a\u6587\u6863\u7684\u8bed\u4e49\u4fe1\u606f&#xff0c;\u63d0\u4f9b\u66f4\u5168\u9762\u7684\u56de\u7b54<\/font><\/li>\n<h3>\u516d\u3001\u5e38\u89c1\u95ee\u9898\u4e0e\u89e3\u51b3\u65b9\u6848<\/h3>\n<h4>6.1 \u5982\u4f55\u5904\u7406\u65e0\u6362\u884c\u7684\u957f\u6587\u672c&#xff1f;<\/h4>\n<p><font color=\"red\">\u95ee\u9898\u63cf\u8ff0&#xff1a;<\/font> \u90e8\u5206\u6587\u6863&#xff08;\u5982OCR\u8bc6\u522b\u7684PDF\u3001\u722c\u866b\u83b7\u53d6\u7684\u6587\u672c&#xff09;\u65e0\u4efb\u4f55\u6362\u884c\u7b26&#xff0c;\u6587\u672c\u5bc6\u96c6\u6392\u5217\u3002\u76f4\u63a5\u5207\u5206\u4f1a\u5bfc\u81f4\u201c\u786c\u5207\u201d&#xff08;\u5982\u5728\u53e5\u5b50\u4e2d\u95f4\u62c6\u5206&#xff09;&#xff0c;\u8bed\u4e49\u65ad\u88c2\u4e25\u91cd&#xff0c;\u5f71\u54cd\u6a21\u578b\u7406\u89e3\u548cA\/B\u6d4b\u8bd5\u8bc4\u4f30\u3002<\/p>\n<p><font color=\"blue\">\u89e3\u51b3\u65b9\u6848&#xff1a;<\/font> \u5148\u901a\u8fc7\u6b63\u5219\u8868\u8fbe\u5f0f\u8bc6\u522b\u4e2d\u6587\u53e5\u672b\u6807\u70b9&#xff0c;\u8865\u5145\u5206\u6bb5\u5206\u9694\u7b26&#xff08;\u5982\u4e24\u4e2a\u6362\u884c&#xff09;&#xff0c;\u5c06\u957f\u6587\u672c\u62c6\u5206\u4e3a\u8bed\u4e49\u8fde\u8d2f\u7684\u6bb5\u843d&#xff1b;\u518d\u4f7f\u7528\u8bed\u4e49\u5206\u5757\u5668\u8fdb\u884c\u5207\u5206&#xff0c;\u907f\u514d\u786c\u5207\u3002<\/p>\n<h4>\u6838\u5fc3\u6b65\u9aa4&#xff1a;<\/h4>\n<li><font color=\"blue\">\u6b63\u5219\u8865\u5145\u5206\u9694\u7b26<\/font>&#xff1a;\u5728\u4e2d\u6587\u53e5\u672b\u6807\u70b9&#xff08;\u3002&#xff01;&#xff1f;&#xff0e;&#xff09;\u540e\u6dfb\u52a0\u4e24\u4e2a\u6362\u884c&#xff0c;\u6a21\u62df\u6bb5\u843d\u7ed3\u6784&#xff1b;<\/li>\n<li><font color=\"green\">\u9884\u5904\u7406\u4f18\u5316<\/font>&#xff1a;\u79fb\u9664\u591a\u4f59\u7a7a\u683c\u548c\u5236\u8868\u7b26&#xff0c;\u907f\u514d\u683c\u5f0f\u5e72\u6270&#xff1b;<\/li>\n<li><font color=\"orange\">\u8bed\u4e49\u5207\u5206<\/font>&#xff1a;\u4f7f\u7528SemanticChunker\u57fa\u4e8e\u8bed\u4e49\u76f8\u4f3c\u5ea6\u5207\u5206&#xff0c;\u786e\u4fdd\u7247\u6bb5\u8bed\u4e49\u5b8c\u6574\u3002<\/li>\n<h4>6.2 \u5982\u4f55\u5904\u7406\u591a\u8bed\u8a00\u6df7\u5408\u6587\u672c&#xff1f;<\/h4>\n<p><font color=\"red\">\u95ee\u9898\u63cf\u8ff0&#xff1a;<\/font> \u4f01\u4e1a\u4e1a\u52a1\u573a\u666f\u4e2d\u5e38\u51fa\u73b0\u4e2d\u82f1\u3001\u4e2d\u65e5\u7b49\u591a\u8bed\u8a00\u6df7\u5408\u6587\u672c&#xff08;\u5982\u6d89\u5916\u5408\u540c\u3001\u8de8\u5883\u4e1a\u52a1\u62a5\u544a\u3001\u53cc\u8bed\u4ea7\u54c1\u8bf4\u660e&#xff09;&#xff0c;\u6838\u5fc3\u75db\u70b9\u96c6\u4e2d\u5728\u4e09\u70b9&#xff1a;\u4e00\u662f\u5206\u9694\u7b26\u4e0d\u517c\u5bb9&#xff08;\u4e2d\u6587\u7528\u201c\u3002\u201d\u3001\u82f1\u6587\u7528\u201c.\u201d&#xff0c;\u5355\u4e00\u5206\u9694\u7b26\u5207\u5206\u6613\u9057\u6f0f\u6216\u8bef\u5207&#xff09;&#xff1b;\u4e8c\u662f\u8bed\u4e49\u6df7\u6742&#xff08;\u4e2d\u82f1\u6587\u53e5\u5b50\u4ea4\u7ec7&#xff0c;\u786c\u5207\u5206\u6613\u5bfc\u81f4\u201c\u4e2d\u82f1\u6587\u8bed\u4e49\u65ad\u88c2\u201d&#xff09;&#xff1b;\u4e09\u662f\u683c\u5f0f\u4e0d\u89c4\u6574&#xff08;\u591a\u8bed\u8a00\u6587\u672c\u5e38\u4f34\u968f\u7a7a\u683c\u3001\u6362\u884c\u6df7\u4e71&#xff0c;\u8fdb\u4e00\u6b65\u5e72\u6270\u5207\u5206\u7cbe\u5ea6&#xff09;\u3002\u8fd9\u4e9b\u95ee\u9898\u4f1a\u5bfc\u81f4\u6a21\u578b\u65e0\u6cd5\u5b8c\u6574\u7406\u89e3\u53cc\u8bed\u8bed\u4e49\u5173\u8054&#xff0c;\u8bc4\u4f30\u7ed3\u679c\u51fa\u73b0\u504f\u5dee\u3002<\/p>\n<p><font color=\"blue\">\u89e3\u51b3\u65b9\u6848&#xff1a;<\/font> \u6838\u5fc3\u903b\u8f91\u4e3a\u201c\u591a\u8bed\u8a00\u9002\u914d&#043;\u9884\u5904\u7406\u89c4\u6574&#043;\u7cbe\u7ec6\u5316\u5206\u5757\u201d\u3002<font color=\"blue\">\u7b2c\u4e00\u6b65&#xff0c;\u6784\u5efa\u878d\u5408\u591a\u8bed\u8a00\u5206\u9694\u7b26\u7684\u901a\u7528\u5206\u9694\u5217\u8868<\/font>&#xff0c;\u8986\u76d6\u4e2d\u82f1\u6587\u6807\u70b9\u3001\u6362\u884c\u7b49\u6838\u5fc3\u5206\u9694\u573a\u666f&#xff1b;<font color=\"green\">\u7b2c\u4e8c\u6b65&#xff0c;\u5bf9\u6df7\u5408\u6587\u672c\u8fdb\u884c\u9884\u5904\u7406&#xff08;\u6e05\u7406\u5197\u4f59\u7a7a\u683c\u3001\u7edf\u4e00\u683c\u5f0f&#xff09;<\/font>&#xff0c;\u51cf\u5c11\u683c\u5f0f\u5e72\u6270&#xff1b;<font color=\"orange\">\u7b2c\u4e09\u6b65&#xff0c;\u57fa\u4e8e\u6587\u672c\u8bed\u4e49\u5bc6\u5ea6\u9002\u914d\u5206\u5757\u53c2\u6570<\/font>&#xff0c;\u786e\u4fdd\u5207\u5206\u540e\u7247\u6bb5\u540c\u65f6\u4fdd\u7559\u4e2d\u82f1\u6587\u8bed\u4e49\u5173\u8054\u3002<\/p>\n<p><img decoding=\"async\" src=\"https:\/\/www.wsisp.com\/helps\/wp-content\/uploads\/2026\/02\/20260224201904-699e07b824e31.jpg\" alt=\"\u6587\u672c\u89e3\u7f20\u65b9\u6848.jpg\" \/><\/p>\n<h3>\u4e03\u3001\u4e92\u52a8\u73af\u8282&#xff1a;\u4f60\u7684\u8bed\u4e49\u4fdd\u7559\u6311\u6218&#xff1f;<\/h3>\n<h4>7.1 <font color=\"purple\">\u4e92\u52a8\u5f15\u5bfc<\/font><\/h4>\n<p>\u4f60\u5728\u6784\u5efaRAG\u7cfb\u7edf\u65f6\u9047\u5230\u8fc7\u54ea\u4e9b\u8bed\u4e49\u4e22\u5931\u7684\u95ee\u9898&#xff1f;\u4f60\u662f\u5982\u4f55\u89e3\u51b3\u7684&#xff1f;\u6b22\u8fce\u5728\u8bc4\u8bba\u533a\u5206\u4eab\u4f60\u7684\u7ecf\u9a8c\u548c\u6280\u5de7&#xff0c;\u8ba9\u6211\u4eec\u4e00\u8d77\u6210\u957f&#xff01;<\/p>\n<h4>7.2 <font color=\"green\">\u8f6c\u8f7d\u58f0\u660e<\/font><\/h4>\n<p>\u672c\u6587\u4e3a Java\u540e\u7aef\u7684Ai\u4e4b\u8def \u539f\u521b\u6587\u7ae0&#xff0c;\u5982\u9700\u8f6c\u8f7d&#xff0c;\u8bf7\u6ce8\u660e\u51fa\u5904<\/p>\n<hr \/>\n<p><font color=\"green\">\u5982\u679c\u4f60\u89c9\u5f97\u8fd9\u7bc7\u6587\u7ae0\u5bf9\u4f60\u6709\u5e2e\u52a9&#xff0c;\u8bf7\u70b9\u8d5e\u3001\u6536\u85cf\u3001\u8f6c\u53d1\u652f\u6301\u4e00\u4e0b&#xff01;<\/font><\/p>\n","protected":false},"excerpt":{"rendered":"<p>LangChain\u8bed\u4e49\u4fdd\u7559\u79d8\u7c4d&#xff1a;\u4ece\u539f\u7406\u5230\u5b9e\u6218\u7684\u7ec8\u6781\u6307\u5357 \u6587\u7ae0\u76ee\u5f55LangChain\u8bed\u4e49\u4fdd\u7559\u79d8\u7c4d&#xff1a;\u4ece\u539f\u7406\u5230\u5b9e\u6218\u7684\u7ec8\u6781\u6307\u5357\u4e00\u3001\u8bed\u4e49\u4e22\u5931&#xff1a;RAG\u7cfb\u7edf\u7684\u201c\u9690\u5f62\u6740\u624b\u201d1.1 \u4ec0\u4e48\u662f\u8bed\u4e49\u4e22\u5931&#xff1f;\u2014\u2014\u4e0d\u4ec5\u4ec5\u662f\u201c\u4fe1\u606f\u88ab\u5207\u65ad\u201d1.2 \u8bed\u4e49\u4e22\u5931\u7684\u6839\u6e90\u63a2\u7a76&#xff08;\u5168\u94fe\u8def\u62c6\u89e3&#xff09;\u5173\u952e\u8ba4\u77e5\u66f4\u65b0&#xff1a;\u4e8c\u3001LangChain TextSplitter\u6df1\u5ea6\u89e3\u6790\u4e0e\u6700\u4f73\u5b9e\u8df52.1 \u4e2d\u6587\u4f18\u5316\u7684RecursiveCharacterT<\/p>\n","protected":false},"author":2,"featured_media":77651,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[1],"tags":[8530,997,4119,50],"topic":[],"class_list":["post-77658","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-server","tag-8530","tag-langchain","tag-rag","tag-50"],"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v20.3 - https:\/\/yoast.com\/wordpress\/plugins\/seo\/ -->\n<title>\u8001\u677f\u8981\u7684RAG\u7cfb\u7edf\u603b\u4e22\u8bed\u4e49\uff0c\u9760LangChain\u56db\u5c42\u9632\u5fa1\uff0c\u518d\u4e5f\u4e0d\u7528\u80cc\u9505\uff01 - \u7f51\u7855\u4e92\u8054\u5e2e\u52a9\u4e2d\u5fc3<\/title>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/www.wsisp.com\/helps\/77658.html\" \/>\n<meta property=\"og:locale\" content=\"zh_CN\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"\u8001\u677f\u8981\u7684RAG\u7cfb\u7edf\u603b\u4e22\u8bed\u4e49\uff0c\u9760LangChain\u56db\u5c42\u9632\u5fa1\uff0c\u518d\u4e5f\u4e0d\u7528\u80cc\u9505\uff01 - \u7f51\u7855\u4e92\u8054\u5e2e\u52a9\u4e2d\u5fc3\" \/>\n<meta property=\"og:description\" content=\"LangChain\u8bed\u4e49\u4fdd\u7559\u79d8\u7c4d&#xff1a;\u4ece\u539f\u7406\u5230\u5b9e\u6218\u7684\u7ec8\u6781\u6307\u5357 \u6587\u7ae0\u76ee\u5f55LangChain\u8bed\u4e49\u4fdd\u7559\u79d8\u7c4d&#xff1a;\u4ece\u539f\u7406\u5230\u5b9e\u6218\u7684\u7ec8\u6781\u6307\u5357\u4e00\u3001\u8bed\u4e49\u4e22\u5931&#xff1a;RAG\u7cfb\u7edf\u7684\u201c\u9690\u5f62\u6740\u624b\u201d1.1 \u4ec0\u4e48\u662f\u8bed\u4e49\u4e22\u5931&#xff1f;\u2014\u2014\u4e0d\u4ec5\u4ec5\u662f\u201c\u4fe1\u606f\u88ab\u5207\u65ad\u201d1.2 \u8bed\u4e49\u4e22\u5931\u7684\u6839\u6e90\u63a2\u7a76&#xff08;\u5168\u94fe\u8def\u62c6\u89e3&#xff09;\u5173\u952e\u8ba4\u77e5\u66f4\u65b0&#xff1a;\u4e8c\u3001LangChain TextSplitter\u6df1\u5ea6\u89e3\u6790\u4e0e\u6700\u4f73\u5b9e\u8df52.1 \u4e2d\u6587\u4f18\u5316\u7684RecursiveCharacterT\" \/>\n<meta property=\"og:url\" content=\"https:\/\/www.wsisp.com\/helps\/77658.html\" \/>\n<meta property=\"og:site_name\" content=\"\u7f51\u7855\u4e92\u8054\u5e2e\u52a9\u4e2d\u5fc3\" \/>\n<meta property=\"article:published_time\" content=\"2026-02-24T20:19:07+00:00\" \/>\n<meta property=\"og:image\" content=\"https:\/\/www.wsisp.com\/helps\/wp-content\/uploads\/2026\/02\/20260224201853-699e07ade676f.gif\" \/>\n<meta name=\"author\" content=\"admin\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:label1\" content=\"\u4f5c\u8005\" \/>\n\t<meta name=\"twitter:data1\" content=\"admin\" \/>\n\t<meta name=\"twitter:label2\" content=\"\u9884\u8ba1\u9605\u8bfb\u65f6\u95f4\" \/>\n\t<meta name=\"twitter:data2\" content=\"5 \u5206\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\/\/schema.org\",\"@graph\":[{\"@type\":\"WebPage\",\"@id\":\"https:\/\/www.wsisp.com\/helps\/77658.html\",\"url\":\"https:\/\/www.wsisp.com\/helps\/77658.html\",\"name\":\"\u8001\u677f\u8981\u7684RAG\u7cfb\u7edf\u603b\u4e22\u8bed\u4e49\uff0c\u9760LangChain\u56db\u5c42\u9632\u5fa1\uff0c\u518d\u4e5f\u4e0d\u7528\u80cc\u9505\uff01 - \u7f51\u7855\u4e92\u8054\u5e2e\u52a9\u4e2d\u5fc3\",\"isPartOf\":{\"@id\":\"https:\/\/www.wsisp.com\/helps\/#website\"},\"datePublished\":\"2026-02-24T20:19:07+00:00\",\"dateModified\":\"2026-02-24T20:19:07+00:00\",\"author\":{\"@id\":\"https:\/\/www.wsisp.com\/helps\/#\/schema\/person\/358e386c577a3ab51c4493330a20ad41\"},\"breadcrumb\":{\"@id\":\"https:\/\/www.wsisp.com\/helps\/77658.html#breadcrumb\"},\"inLanguage\":\"zh-Hans\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\/\/www.wsisp.com\/helps\/77658.html\"]}]},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\/\/www.wsisp.com\/helps\/77658.html#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"\u9996\u9875\",\"item\":\"https:\/\/www.wsisp.com\/helps\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"\u8001\u677f\u8981\u7684RAG\u7cfb\u7edf\u603b\u4e22\u8bed\u4e49\uff0c\u9760LangChain\u56db\u5c42\u9632\u5fa1\uff0c\u518d\u4e5f\u4e0d\u7528\u80cc\u9505\uff01\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\/\/www.wsisp.com\/helps\/#website\",\"url\":\"https:\/\/www.wsisp.com\/helps\/\",\"name\":\"\u7f51\u7855\u4e92\u8054\u5e2e\u52a9\u4e2d\u5fc3\",\"description\":\"\u9999\u6e2f\u670d\u52a1\u5668_\u9999\u6e2f\u4e91\u670d\u52a1\u5668\u8d44\u8baf_\u670d\u52a1\u5668\u5e2e\u52a9\u6587\u6863_\u670d\u52a1\u5668\u6559\u7a0b\",\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\/\/www.wsisp.com\/helps\/?s={search_term_string}\"},\"query-input\":\"required name=search_term_string\"}],\"inLanguage\":\"zh-Hans\"},{\"@type\":\"Person\",\"@id\":\"https:\/\/www.wsisp.com\/helps\/#\/schema\/person\/358e386c577a3ab51c4493330a20ad41\",\"name\":\"admin\",\"image\":{\"@type\":\"ImageObject\",\"inLanguage\":\"zh-Hans\",\"@id\":\"https:\/\/www.wsisp.com\/helps\/#\/schema\/person\/image\/\",\"url\":\"https:\/\/gravatar.wp-china-yes.net\/avatar\/?s=96&d=mystery\",\"contentUrl\":\"https:\/\/gravatar.wp-china-yes.net\/avatar\/?s=96&d=mystery\",\"caption\":\"admin\"},\"sameAs\":[\"http:\/\/wp.wsisp.com\"],\"url\":\"https:\/\/www.wsisp.com\/helps\/author\/admin\"}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"\u8001\u677f\u8981\u7684RAG\u7cfb\u7edf\u603b\u4e22\u8bed\u4e49\uff0c\u9760LangChain\u56db\u5c42\u9632\u5fa1\uff0c\u518d\u4e5f\u4e0d\u7528\u80cc\u9505\uff01 - \u7f51\u7855\u4e92\u8054\u5e2e\u52a9\u4e2d\u5fc3","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/www.wsisp.com\/helps\/77658.html","og_locale":"zh_CN","og_type":"article","og_title":"\u8001\u677f\u8981\u7684RAG\u7cfb\u7edf\u603b\u4e22\u8bed\u4e49\uff0c\u9760LangChain\u56db\u5c42\u9632\u5fa1\uff0c\u518d\u4e5f\u4e0d\u7528\u80cc\u9505\uff01 - \u7f51\u7855\u4e92\u8054\u5e2e\u52a9\u4e2d\u5fc3","og_description":"LangChain\u8bed\u4e49\u4fdd\u7559\u79d8\u7c4d&#xff1a;\u4ece\u539f\u7406\u5230\u5b9e\u6218\u7684\u7ec8\u6781\u6307\u5357 \u6587\u7ae0\u76ee\u5f55LangChain\u8bed\u4e49\u4fdd\u7559\u79d8\u7c4d&#xff1a;\u4ece\u539f\u7406\u5230\u5b9e\u6218\u7684\u7ec8\u6781\u6307\u5357\u4e00\u3001\u8bed\u4e49\u4e22\u5931&#xff1a;RAG\u7cfb\u7edf\u7684\u201c\u9690\u5f62\u6740\u624b\u201d1.1 \u4ec0\u4e48\u662f\u8bed\u4e49\u4e22\u5931&#xff1f;\u2014\u2014\u4e0d\u4ec5\u4ec5\u662f\u201c\u4fe1\u606f\u88ab\u5207\u65ad\u201d1.2 \u8bed\u4e49\u4e22\u5931\u7684\u6839\u6e90\u63a2\u7a76&#xff08;\u5168\u94fe\u8def\u62c6\u89e3&#xff09;\u5173\u952e\u8ba4\u77e5\u66f4\u65b0&#xff1a;\u4e8c\u3001LangChain TextSplitter\u6df1\u5ea6\u89e3\u6790\u4e0e\u6700\u4f73\u5b9e\u8df52.1 \u4e2d\u6587\u4f18\u5316\u7684RecursiveCharacterT","og_url":"https:\/\/www.wsisp.com\/helps\/77658.html","og_site_name":"\u7f51\u7855\u4e92\u8054\u5e2e\u52a9\u4e2d\u5fc3","article_published_time":"2026-02-24T20:19:07+00:00","og_image":[{"url":"https:\/\/www.wsisp.com\/helps\/wp-content\/uploads\/2026\/02\/20260224201853-699e07ade676f.gif"}],"author":"admin","twitter_card":"summary_large_image","twitter_misc":{"\u4f5c\u8005":"admin","\u9884\u8ba1\u9605\u8bfb\u65f6\u95f4":"5 \u5206"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"WebPage","@id":"https:\/\/www.wsisp.com\/helps\/77658.html","url":"https:\/\/www.wsisp.com\/helps\/77658.html","name":"\u8001\u677f\u8981\u7684RAG\u7cfb\u7edf\u603b\u4e22\u8bed\u4e49\uff0c\u9760LangChain\u56db\u5c42\u9632\u5fa1\uff0c\u518d\u4e5f\u4e0d\u7528\u80cc\u9505\uff01 - \u7f51\u7855\u4e92\u8054\u5e2e\u52a9\u4e2d\u5fc3","isPartOf":{"@id":"https:\/\/www.wsisp.com\/helps\/#website"},"datePublished":"2026-02-24T20:19:07+00:00","dateModified":"2026-02-24T20:19:07+00:00","author":{"@id":"https:\/\/www.wsisp.com\/helps\/#\/schema\/person\/358e386c577a3ab51c4493330a20ad41"},"breadcrumb":{"@id":"https:\/\/www.wsisp.com\/helps\/77658.html#breadcrumb"},"inLanguage":"zh-Hans","potentialAction":[{"@type":"ReadAction","target":["https:\/\/www.wsisp.com\/helps\/77658.html"]}]},{"@type":"BreadcrumbList","@id":"https:\/\/www.wsisp.com\/helps\/77658.html#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"\u9996\u9875","item":"https:\/\/www.wsisp.com\/helps"},{"@type":"ListItem","position":2,"name":"\u8001\u677f\u8981\u7684RAG\u7cfb\u7edf\u603b\u4e22\u8bed\u4e49\uff0c\u9760LangChain\u56db\u5c42\u9632\u5fa1\uff0c\u518d\u4e5f\u4e0d\u7528\u80cc\u9505\uff01"}]},{"@type":"WebSite","@id":"https:\/\/www.wsisp.com\/helps\/#website","url":"https:\/\/www.wsisp.com\/helps\/","name":"\u7f51\u7855\u4e92\u8054\u5e2e\u52a9\u4e2d\u5fc3","description":"\u9999\u6e2f\u670d\u52a1\u5668_\u9999\u6e2f\u4e91\u670d\u52a1\u5668\u8d44\u8baf_\u670d\u52a1\u5668\u5e2e\u52a9\u6587\u6863_\u670d\u52a1\u5668\u6559\u7a0b","potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/www.wsisp.com\/helps\/?s={search_term_string}"},"query-input":"required name=search_term_string"}],"inLanguage":"zh-Hans"},{"@type":"Person","@id":"https:\/\/www.wsisp.com\/helps\/#\/schema\/person\/358e386c577a3ab51c4493330a20ad41","name":"admin","image":{"@type":"ImageObject","inLanguage":"zh-Hans","@id":"https:\/\/www.wsisp.com\/helps\/#\/schema\/person\/image\/","url":"https:\/\/gravatar.wp-china-yes.net\/avatar\/?s=96&d=mystery","contentUrl":"https:\/\/gravatar.wp-china-yes.net\/avatar\/?s=96&d=mystery","caption":"admin"},"sameAs":["http:\/\/wp.wsisp.com"],"url":"https:\/\/www.wsisp.com\/helps\/author\/admin"}]}},"_links":{"self":[{"href":"https:\/\/www.wsisp.com\/helps\/wp-json\/wp\/v2\/posts\/77658","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.wsisp.com\/helps\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.wsisp.com\/helps\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.wsisp.com\/helps\/wp-json\/wp\/v2\/users\/2"}],"replies":[{"embeddable":true,"href":"https:\/\/www.wsisp.com\/helps\/wp-json\/wp\/v2\/comments?post=77658"}],"version-history":[{"count":0,"href":"https:\/\/www.wsisp.com\/helps\/wp-json\/wp\/v2\/posts\/77658\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.wsisp.com\/helps\/wp-json\/wp\/v2\/media\/77651"}],"wp:attachment":[{"href":"https:\/\/www.wsisp.com\/helps\/wp-json\/wp\/v2\/media?parent=77658"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.wsisp.com\/helps\/wp-json\/wp\/v2\/categories?post=77658"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.wsisp.com\/helps\/wp-json\/wp\/v2\/tags?post=77658"},{"taxonomy":"topic","embeddable":true,"href":"https:\/\/www.wsisp.com\/helps\/wp-json\/wp\/v2\/topic?post=77658"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}