CLue - Command Line tool for Apache Lucene

Overview:

When working with Lucene, it is often useful to inspect an index.

Luke is awesome, but often times it is not feasible to inspect an index on a remote machine using a GUI. That's where Clue comes in. You can ssh into your production box and inspect your index using your favorite shell.

Another important feature for Clue is the ability to interact with other Unix commands via piping, e.g. grep, more etc.

License:

Clue is under the Apache Public License v2.0.

Bugs:

Please file bugs and feature requests here.

Downloads:

latest version: 0.0.4

What's new in this release?

Add ability to investigate indexes on HDFS
Add command to dump the index
Add command to import from a dumped index
Add configuration support, now you can configure Clue to run your own custom code
Add index trimming functionlity: sometimes you want a smaller index to work with
lucene 4.5.1 upgrade

source: release-0.0.4.zip

clue-all executable jar with all dependencies: clue-all-0.0.4.jar

clue jar with only clue class files, used as a library: clue-0.0.4.jar

distribution clue.tar.gz

Build:

mvn package

This will create 2 artifacts in the target directory:

clue-xxx.jar

jar file containing all clue classes.
clue-all-xxx.jar

executable jar file containing all clue classes as well as all runtime dependencies, e.g. java -jar clue-all-xxx.jar works

Run:

Interactive Mode:

./bin/clue.sh my-idx

Non-interactive Mode:

./bin/clue.sh my-idx command args

Command list:

./bin/clue.sh my-idx help

using configuration file found at: /Users/johnwang/github/clue/config/clue.conf
Analyzer: 		class org.apache.lucene.analysis.standard.StandardAnalyzer
Query Builder: 		class com.senseidb.clue.api.DefaultQueryBuilder
Directory Builder: 	class com.senseidb.clue.api.DefaultDirectoryBuilder
IndexReader Factory: 	class com.senseidb.clue.api.DefaultIndexReaderFactory
delete - deletes a list of documents from searching via a query, input: query
directory - prints directory information
docval - gets doc value for a given doc, <field> <docid>, if <docid> not specified, all docs are shown
exit - exits program
explain - shows score explanation of a doc
export - export index to readable text files
help - displays help
info - displays information about the index, <segment number> to get information on the segment
merge - force merges segments into given N segments, input: number of max segments
norm - displays norm values for a field for a list of documents
postings - iterating postings given a term, e.g. <fieldname:fieldvalue>
readonly - puts clue in readonly mode
reconstruct - reconstructs an indexed field for a document
search - executes a query against the index, input: <query string>
stored - displays stored data for a given field
terms - gets terms from the index, <field:term>, term can be a prefix
trim - trims the index, <TRIM PERCENTAGE> <OPTIONS>, options are: head, tail, random
tv - shows term vector of a field for a doc

Build a sample index to play with:

Clue bundles with some test data (15000 car data) for you to build a sample index to play with, do:

./bin/build_sample_index.sh my-idx

Examples:

Getting all the terms in the field 'color_indexed':

./bin/clue.sh my-idx terms color_indexed
Getting all the terms in the field 'color_indexed' starting with the term staring with 'r':

./bin/clue.sh my-idx terms color_indexed:r

./bin/clue.sh my-idx terms color_indexed | grep r
Do a search:

./bin/clue.sh my-idx search myquery
Get the index info:

./bin/clue.sh my-idx info
Iterate a posting for the term color_indexed:red

./bin/clue.sh my-idx postings color_indexed:red
List docvalues for the column-stride-field color:

./bin/clue.sh my-idx docval color
Get docvalue for the column-stride-field category for document 4:

./bin/clue.sh my-idx docval category 5
Get docvalue for the column-stride-field year of type numeric for document 3:

./bin/clue.sh my-idx docval year 3
Get docvalue for the column-stride-field json of type binary for document 3:

./bin/clue.sh my-idx docval json 3
Get docvalue for the column-stride-field tags of type sorted-set for document 3:

./bin/clue.sh my-idx docval tags 3

Name		Name	Last commit message	Last commit date
Latest commit History 85 Commits
bin		bin
config		config
src		src
CHANGES.txt		CHANGES.txt
LICENSE.TXT		LICENSE.TXT
README.md		README.md
pom.xml		pom.xml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

CLue - Command Line tool for Apache Lucene

Overview:

License:

Bugs:

Downloads:

What's new in this release?

Build:

Run:

Build a sample index to play with:

Examples:

About

Releases

Packages

Languages

License

ugoscaiella/clue

Folders and files

Latest commit

History

Repository files navigation

CLue - Command Line tool for Apache Lucene

Overview:

License:

Bugs:

Downloads:

What's new in this release?

Build:

Run:

Build a sample index to play with:

Examples:

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages