I know there are already objects supporting Office 2007 files, but is there any native Office 2003 or earlier support ?
Using Zend Lucene to search Office 2003 or older files
337 views Asked by Amadeus45 At
2
There are 2 answers
0
Brian
On
I would recommend indexing the documents with Solr and Tika together and using JSON to search your Solr/Lucene index from PHP. See the ExtractingRequestHandler (Solr wiki page) article for more information.
Related Questions in PHP
- How to add the dynamic new rows from my registration form in my database?
- Issue in payment form gateway
- How to create a facet for WP gridbuilder that displays both parent and child custom fields?
- Function in anonymous Laravel Blade component
- How to change woocomerce or full wordpress currency with value from USD to AUD
- General questions about creating a custom theme Moodle CMS
- How to add logging to an abstract class in php
- error 500 on IIS FastCGI but no clue despite multiple error loggings activated
- Composer installation fails and reverts ./composer.json and ./composer.lock to original content
- How to isolate PHP apps from each other on a local machine(Windows or Linux)?
- Laravel: Using belongsToMany relationship with MongoDB
- window.location.href redirects but is causing problems on the webpage
- Key provided is shorter than 256 bits, only 64 bits provided
- Laravel's whereBetween method not working with two timestamps
- Implementing UUID as primary key in Laravel intermediate table
Related Questions in ZEND-FRAMEWORK
- How to properly quote / escape this INSERT statement in Zend 1 Framework
- Zend Barcode label distance ajustment
- cast in doctrine query builder
- Laminas $filter->getValues() in getData() of laminas-form return duplicate of array collection when using Element File
- I want to translate country name to english in using diffrent country locale Magento2?
- How to get the name of the module, controller and action in Laminas framework?
- Zend XmlRpc Client gzinflate out of memory on php 7.1
- Zend Barcode image isn't generate correctly in CodeIgniter
- How to switch between the read-only and read-write db conenctions in Laminas using DBAL
- Database transaction is not working as expected even rollback is not working
- Ajax call in Zend 1.12 return 404
- Zend Framework 1 legacy application redesign
- Zend Barcode image do not generate correcly
- How to click the button after displaying the dialog box in jquery
- Zend Session Error, Session must be started before any output has been sent to browser
Related Questions in SOLR
- Upgrading to Solr 9 failes due to NoSuchFileException
- regex to produce duplicate string with modification
- Apache atlas UI not showing up
- SAP Commerce Cloud multisite SOLR configuration
- Solr 9 punctuation issue
- Accessing solr web interface behind reverse proxy returns "Content Encoding Error"
- Getting NPE in apache SOLR 8.11.2 while doing atomic update using add-distinct from my java based appication
- how to specify the maximum number of clusters for the STC algorithm in Solr admin console?
- SOLR compatibility of the KNN query parser with function queries
- How to use Solr as retriever in RAG
- Multiple replacement / substitute NGgram string SOLR 8.6
- Solr updates are taking too long. The update requests are stalling
- solrCloud(9.5) integrates springboots, and adds user authentication, and there is no problem with queries, but the new one keeps reporting errors
- Why does Spring Data for Apache Solr run a count query before running the actual query?
- SOLR 'facet.prefix' is not working as expected
Related Questions in LUCENE
- How to update Cassandra Lucene index with a new column? rebuild or update index?
- How to glue (merge) files Lucene?
- Apache Lucene performance estimation
- Lucene DocValues.Source deprecated
- Solr score diff in doc list and Explain score
- How do I reload the index before searching in Hibernate Lucene
- Using Lucene 9.10.0 MemoryIndex in Java to ingest and search IntField and use rangequery
- How can i use a builtin analyzer in my entity with Hibernate Search
- Atlas Search Index Build Fail
- how to use hiberanate search 7.1.0 analyzer settin in spring boot 3
- Suggester template Search issue ElasticSearch
- I'm using hibernate text based search and indexing. I want to search common rows between indexed tables using Lucene query
- Merging Solr index stored in HDFS not working
- Can't find document at lucene index with no delimeter in phrase
- How do I get the list of the full indexed terms in an ElasticSearch index?
Related Questions in SOLR-CELL
- solr /update/extract 404 Not Found
- Solr cell avoid metadata in fmap.content
- Solr Cell turn off metadata extraction
- Indexing a PDF document and providing additional JSON data using Solr Cell
- Soalrium PHP Extract query setFile()
- Does SOLR cell in any way limit the amount of characters imported into a solr.TextField?
- Solr 7.5 failing to index pdf files after upgrade from Solr 6.3
- Solr Cell fails to index image files with EXIF
- Importing files with solr cell/Tika metadata causes a multiple value error
- How does SOLR Cell add document content?
- Solr ExtractingRequestHandler giving empty content field
- Integrate Apache TIKA and Solr Cell with Solr to index pdf and word documents
- Solr: Perform stemming on a field and get the sorted list of stemmed words which were most frequent
- How to remove a lot of "\n" in text extracted from a Word file using Solr?
- Solrj ContentStreamUpdateRequest fails to save all literal fields unless they are dynamic
Popular Questions
- How do I undo the most recent local commits in Git?
- How can I remove a specific item from an array in JavaScript?
- How do I delete a Git branch locally and remotely?
- Find all files containing a specific text (string) on Linux?
- How do I revert a Git repository to a previous commit?
- How do I create an HTML button that acts like a link?
- How do I check out a remote Git branch?
- How do I force "git pull" to overwrite local files?
- How do I list all files of a directory?
- How to check whether a string contains a substring in JavaScript?
- How do I redirect to another webpage?
- How can I iterate over rows in a Pandas DataFrame?
- How do I convert a String to an int in Java?
- Does Python have a string 'contains' substring method?
- How do I check if a string contains a specific word?
Trending Questions
- UIImageView Frame Doesn't Reflect Constraints
- Is it possible to use adb commands to click on a view by finding its ID?
- How to create a new web character symbol recognizable by html/javascript?
- Why isn't my CSS3 animation smooth in Google Chrome (but very smooth on other browsers)?
- Heap Gives Page Fault
- Connect ffmpeg to Visual Studio 2008
- Both Object- and ValueAnimator jumps when Duration is set above API LvL 24
- How to avoid default initialization of objects in std::vector?
- second argument of the command line arguments in a format other than char** argv or char* argv[]
- How to improve efficiency of algorithm which generates next lexicographic permutation?
- Navigating to the another actvity app getting crash in android
- How to read the particular message format in android and store in sqlite database?
- Resetting inventory status after order is cancelled
- Efficiently compute powers of X in SSE/AVX
- Insert into an external database using ajax and php : POST 500 (Internal Server Error)
There doesn't seem to be anything bundled with
Zend_Search_Lucene, for those.Still, considering it can index HTML documents, if you can find a way to convert your Office 2003 documents to HTML (at least, for indexing -- keeping to original version alonside the HTML one, for consultation), you might be able to index those...