HOME
WsoVid - Dossier Search Tool
What is it?
|
MAN-DALA-WSO-VIDA-ET-WSO-VEDA All-Seer is All-Knower |
WsoVid is a stand-alone tool written in C++ to perform fast searches on a local dossier of files or selected online resources. It is a search index and search server that works immediately and requires no configuration.
- Builds an index in memory. Index accepts keyword fragments for search.
- Can save index to disk and restore it later.
- Offers an ad-hoc search server via Telnet.
- Can be hooked up behind a web server via SCGI
- Ranks content by the amount of human-readable content it sees
- Clusters results in three categories: exact phrase matches, AND appearance of keywords, any other appearance of keywords
Get further information here
Why use it?
Researchers collect potentially interesting content that must be searched much later.
Dossiers greatly pre-amplify relevance of results but in order to be of value, the response time must be very short.
In a dossier of interest:
- Type of data is very heterogeneous
- Data is not organized or organized in file system
- Data is poorly labeled or not labeled at all
- Searches occur ad-hoc and must accept parts of keywords and combinations thereof
- Is a mixture of local files, ripped online sources, links to online pages
WsoVid is not meant to compete with Google Search, Mendeley, Yacy, grep or fzf. It is an additional tool in your searching toolbox.
Where to get it?
You can clone the fossil repository to get WsoVid.
fossil clone https://lodwich.net/fossil/wsovid
The license is MIT.
The tool is built using a bash script that works on Debian. Tool has been successfully built on GhostBSD, too. Tool is not going to work on Windows.
- ./build.sh - will build a debug version
- ./build-release.sh - will build a production version
- ./tests.sh - run tests, and if LCOV is installed, also get some coverages
What are the dependencies?
WsoVid will depend on external tools to function correctly:
- curl
- pdf2txt
- djvutxt
- libreoffice
Use-Case Overview
- interact via WsoVid's shell
- dump and re-load-merge index
- index local files
- crawl the web
- attach to web server via SCGI
- use telnet to query the index
- use the web browser to query the index
How to try?
Search this site with the WsoVid tool using its SCGI interface: