你的浏览器禁用了JavaScript, 请开启后刷新浏览器获得更好的体验!

如何用Solr 3.1配置Tika 0.9

| 你能给我一些用Solr 3.1配置Tika 0.9的步骤吗

<requestHandler name=\"/update/extract\" 
                  startup=\"lazy\"
                  class=\"solr.extraction.ExtractingRequestHandler\" >
    <lst name=\"defaults\">
      <!-- All the main content goes into \"text\"... if you need to return
           the extracted text or do highlighting, use a stored field. -->
      <str name=\"fmap.content\">text</str>
      <str name=\"lowernames\">true</str>
      <str name=\"uprefix\">ignored_</str>

      <!-- capture link hrefs but ignore div attributes -->
      <str name=\"captureAttr\">true</str>
      <str name=\"fmap.a\">links</str>
      <str name=\"fmap.div\">ignored_</str>
    </lst>
  </requestHandler>

我在solrconfig.xml中使用此配置请帮我谢谢，

已邀请:

1 个回复

递劝臼类洪

假设您在solr中安装了Tika（和依赖项），那么这应该就是您所需要做的。您是否已阅读ExtractingRequestHandler Wiki页面？它具有很多信息，并且方便地还有一些使用curl的配方，可以让您测试它们是否正常工作。

要回复问题请先登录或注册

如何用Solr 3.1配置Tika 0.9

1 个回复

发起人

apache_tika

solr_cell

solr

问题状态

如何用Solr 3.1配置Tika 0.9

与内容相关的链接

1 个回复

发起人

apache_tika

solr_cell

solr

问题状态