Settings & History
  • You can use boolean operators (-, |, OR), wildcards (*, ?), and phrase search (") in your query
  • For BitTorrent: Paste in a 40 characters info_hash, to search for that particular torrent and get all trackers for it
Browse|Add this search and API to your site



<< Post  Cute movie   ::   New site added to the isohunt.com family  Post >>

Author Message
IH

Admin, Dev, Janitor


Joined: 21 Jan 2003
Posts: 3620
Location: 127.0.0.1

Status: Offline
Reputation: 3310

Post Posted: Sat Sep 17, 2005 1:58 pm Reply with quote   Back to top    

Firstly, there have been quite a lot of downtime lately. Our database suffered disk failures (twice), and some new software broke. Everything should be back to normal now (faster actually, with some code optimization).

After a little break from major coding besides continual small tweaks here and there, I've done a fair bit of code changes. A lot of minor fixes, and here are some notable fixes and new features:

- New default torrent search results sorting (REL), using combination of search terms relevance, seeds and leechers, so you get the most relevant and popular torrents at the top of the lists
- Improved query parsing for more accurate search terms matching (more below in comments)
- New tracker scraping algorithms, so you'll get more updated and accurate seeds/leechers stats in torrent search results
- More convenient single column search results sorting
- Fixed "undefined" links in search results and release pages
- RSS feeds now return more results and in more timely manner, for all searches as before

Together with more relevant search results and updated stats, our search index have also expanded to new record of over 4.5 million files (over 150 TB worth) contained in over 240,000 torrents, indexed from all major BitTorrent sites across the internet. This represents the largest repository of BitTorrent resources available anywhere, with the most advanced interface (sort on any date, seeds, etc. columns in web search, and RSS feeds on the latest in all searches).

With these foundation work out of the way, I'll be starting work on brand new features for isoHunt v3. Be prepared to be surprised Wink


UPDATE: Fixed bug with phrase search ("double quotes") not returning results.

_________________
"He is no fool who gives up what he cannot keep to gain what he cannot lose." - Jim Elliot
"Science without religion is lame: Religion without science is blind." - Albert Einstein
"The best way to predict the future is to invent it." - Alan Kay

Last edited by IH on Mon Sep 19, 2005 2:09 pm; edited 3 times in total
View user's profile  Send private message            
IH

Admin, Dev, Janitor


Joined: 21 Jan 2003
Posts: 3620
Location: 127.0.0.1

Status: Offline
Reputation: 3310

Post Posted: Sat Sep 17, 2005 2:21 pm Reply with quote   Back to top    

Below are boolean operators for advanced search. By default, you can just type in search words without these and isoHunt.com will return you pretty good results (more on how isoHunt tweaks your query below)

mysql.com wrote:
Search Operators

+
A leading plus sign indicates that this word must be present in every row returned.

-
A leading minus sign indicates that this word must not be present in any row returned.

(no operator)
By default (when neither + nor - is specified) the word is optional, but the rows that contain it will be rated higher.

*
An asterisk is the truncation operator. Unlike the other operators, it should be appended to the word.

"
A phrase that is enclosed within double quote (`"') characters matches only rows that contain the phrase literally, as it was typed.

> <
These two operators are used to change a word's contribution to the relevance value that is assigned to a row. The > operator increases the contribution and the < operator decreases it. See the example below.

( )
Parentheses are used to group words into subexpressions. Parenthesized groups can be nested.

~
A leading tilde acts as a negation operator, causing the word's contribution to the row relevance to be negative. It's useful for marking noise words. A row that contains such a word will be rated lower than others, but will not be excluded altogether, as it would be with the - operator.


The following examples demonstrate some search strings that use boolean full-text operators:

apple banana
Find rows that contain at least one of the two words.

+apple +juice
Find rows that contain both words.

+apple macintosh
Find rows that contain the word ``apple'', but rank rows higher if they also contain ``macintosh''.

+apple -macintosh
Find rows that contain the word ``apple'' but not ``macintosh''.

apple*
Find rows that contain words such as ``apple'', ``apples'', ``applesauce'', or ``applet''.

"some words"
Find rows that contain the exact phrase ``some words'' (for example, rows that contain ``some words of wisdom'' but not ``some noise words''). Note that the `"' characters that surround the phrase are operator characters that delimit the phrase. They are not the quotes that surround the search string itself.

+apple +(>turnover <strudel)
Find rows that contain the words ``apple'' and ``turnover'', or ``apple'' and ``strudel'' (in any order), but rank ``apple turnover'' higher than ``apple strudel''.


With so many search items in our index, we've found it necessary that in most cases, an AND search of all terms will return a narrower but far more accurate results. However, an AND search on too many terms usually yield 0 results. Therefore as a compromise, our new query parser now optimizes your query in the following way:

1) The first 3 words without an operator are appended with a + sign (AND search), which you can override by appending the | operator in front of search terms
2) Subsequent words are left alone as optional keywords

For example, search on isoHunt for:

one two three four five six seven
would be modifed to actually
+one +two +three four five six seven

|one |two |three four five six seven
would be modifed to actually
one two three +four +five +six seven

|one |two |three four +five six seven
would be modifed to actually
one two three +four +five +six +seven


So for normal searching using long queries without boolean operators, just search with more important words at the beginning. Also note that any search word less than 2 characters in length will be ignored in the search. Have fun power searching!

_________________
"He is no fool who gives up what he cannot keep to gain what he cannot lose." - Jim Elliot
"Science without religion is lame: Religion without science is blind." - Albert Einstein
"The best way to predict the future is to invent it." - Alan Kay
View user's profile  Send private message            
IH

Admin, Dev, Janitor


Joined: 21 Jan 2003
Posts: 3620
Location: 127.0.0.1

Status: Offline
Reputation: 3310

Post Posted: Tue Mar 14, 2006 12:17 am Reply with quote   Back to top    

After the major upgrade, searches now have all terms considered (complete AND search by default). You can still override it by adding "|" in front of terms for making it optional.

These are boolean operators that can be used:

+ (already assumed), -, |, "", ()
(other boolean operators are now deprecated)

ex. Search for (|word1 |word2) searches for EITHER of the 2 words, when the default behaviour without any boolean operators would be (+word1 +word2), both words required in the results.
(word1 || word2) and (word1 OR word2) are boolean word variants, and are the same as (|word1 |word2).

Wildcard operators:
*
ex. Search for (gent*) would search for "gentle", "gentoo", and other terms that start with gent.
?
ex. Search for (te?t) would search for "test", "text", etc. Or terms that replaces ? with one character.


Also, note the following list of stopwords, which are words too common and are not used in searching nor are included in the index:
Code:
"a", "an", "and", "are", "as", "at", "be", "but", "by",
"for", "if", "in", "into", "is", "it",
"no", "not", "of", "on", "or", "s", "such",
"t", "that", "the", "their", "then", "there", "these",
"they", "this", "to", "was", "will", "with",
"from",
"files",
"directory",           
"nbsp",                                       
"com",                                           
"www",
"file",   
"http", 
"visit"


These words are however INCLUDED within phrase searches.

_________________
"He is no fool who gives up what he cannot keep to gain what he cannot lose." - Jim Elliot
"Science without religion is lame: Religion without science is blind." - Albert Einstein
"The best way to predict the future is to invent it." - Alan Kay
View user's profile  Send private message            
Display posts from previous:       

<< Post  Cute movie   ::   New site added to the isohunt.com family  Post >>

You cannot post new topics in this forum
You cannot reply to topics in this forum
You cannot edit your posts in this forum
You cannot delete your posts in this forum
You cannot vote in polls in this forum

This site features search engines on metadata only. It is a service independent of the IRC and BitTorrent networks. Use at your own risk.


Powered by phpBB :: All times are GMT - 7 Hours



-ADVERTISEMENT-
BTGuard - Download Anonymously

VIP

membership is only $1/month, with perks like turning off all annoying ads. We dislike ads as much as you do!

Our 2nd contest winning t-shirt design! You know you want it!
NEW 8 Years Anniversary Tee!



Random Poll
Should we create a new youtube?
Yes
No
Torrent Every video on the site

New Posts

Friends
TorrentBox
Podtropolis

TorrentFreak
Torrents.to

FAC, CMCC
Defend Fair Use
Neutrality.ca

This site features search engines on metadata only. It is a service independent of the IRC and BitTorrent networks. Use at your own risk.
Canadian Coalition for Electronic Rights - CCER.CA   Lighttpd   Get Firefox   FF Plugins, Toolbar & Widgets

Page generation: 1.97s (1% in 10 SQLs) on b05, loadavg: 1.39       © isoHunt Inc. | Privacy & Copyright Policies