Xrumer Tips
Links database analysis – How to use it
by Xrumer_Pros on Jan.26, 2010, under Xrumer Tips
In this lesson it will be detailed described how correctly to use Links database analysis in 3 different cases:
1. Filtering adult-resources by hostname from mixed database.
2. Extracting only forums on phpBB
3. Database checking for “200 OK”- that mean extracting only working links.
There are just few examples of usage of this tool in different aims. Besides, this tool can extract only Russian resources (not using domain zones), or resources which use a specific engine/themes etc.
Example № 1
Filtering adult-resources by hostname from mixed database
Suppose, that in domain name of host contains adult keywords, then domain will have relation to adult content. This report is at 97% true, that mean almost all domains which contains adult keyword have adult content, and 3 % will not change much.
1) Choose adult keywords. For example, I chose only 5 keywords.
sex intim porn xxx erotic
2) Let set up the toll. For example, I took LinkList id22.txt that came free with Xrumer. Filtration will be by domain zone, and not by content – so number of treads is not so important.
In “Search:” field, enter your list of keywords. So it will look like:
3) Press “Run”. Process will take 2-3 seconds, because we use search by hostnames. So in few seconds we can see results.
As it can see the result is saved in new created database LinksList id22_mod.txt. Open this database – there are 773 links. If we have used more keywords then the result would be above.
... http://www.sexpacking.com/forum/read.php?2,358,page=6 http://www.asexuality.org/discussion/index.php http://sex-work.org/forums/index.php http://forum.literotica.com/sendmessage.php http://www.telefonsex2002.de/telefonsex-forum/index.php http://www.labanlieuesexprime.org/forum.php3?id_article=2 http://www.yusex.com/forum/index.php http://www.sexy-tipp.ch/forum/messages/21867/1481.html?1098903062 http://www.pod-porn.com/cgi-bin/distribb/ultimatebb.cgi http://bbs.porncity.net/index.php http://www.asexstories.com/community/index.php http://www.nofauxxx.com/boards/phpBB2/index.php http://phebus.journalintime.com/forum/ http://www.pornstarkings.com/index.php http://greatsexgames.com/forums/index.php http://www.worldsexguide.com/forum/index.php http://www.sexinfo.ro/forum/index.php ...
There are certainly small errors – in fact the word “express” after the word that finish with “s” will be as our keyword “sex”. These errors can be filtered by running the tool second time.
The task to collect adult database from mixed database is fulfilled less than in 5 minute.
Example №2
Extracting only forums on phpBB
1) Now will make a search by content and not by hostname. The process will be similar to first one with exception that keywords will be search in content of site and not in hostname. It will take some time to do search. Will use some keywords like:
phpBB viewforum.php viewtopic.php profile.php?mode=register
2) Let set up the toll. I took same database as in first example (LinkList id22.txt). In this case filtration will be by content, and not by hostname (domain zone). The number of treads will be 30 at 5Mb/sec connection. In “Search:” field enters your list of keywords. So it will look like:
3) Press “Run”. In few minutes will be checked more than 3000 URLs
At the end in new created database LinksList id2_mod.txt will be more than 11.000 forums on phpBB (from 25 000 links database):
Code
... http://AvtoSreda.RU/forum/index.php http://www.stroykann.ru/forum/index.php http://www.krada.org/forum/index.php http://forum.neoclub.ru/index.php http://forum.sch192.ru/index.php http://www.arbinada.com/modules.php?name=Forums http://forum.mashexport.com/index.php http://forum.kayman-k.ru/index.php http://fengshuiby.com/forum/index.php http://autoshina.kz/frm///index.php http://www.kachok.ru/forum/index.php http://www.evrostroika.ru/forum/index.php http://forum.spblove.ru/index.php http://mirabeltour.com/mirabelforum/index.php http://forum.americanfootball.ru/index.php http://www.f1-game.ru/forum/index.php http://cinema.kgd.info/forum/index.php http://forum.zapavto.ru/index.php http://forum.vinfo.ru/index.php ...
Example №3
Database checking for “200 OK”- that mean extracting only working links.
1) This example is analogical with search by content. For searching for “200 OK” it is not necessary to download full page, it is enough to download only topic. At begin of this topic should be “200 OK”. If in topic will be “404 NOT FOUND” or “403 FORBIDDEN”, then this link mismatches with our search. So in “Search:” field should use only one line:
“200 OK”
2) Use same settings as in last example, with exception don’t forget to enable option “Check only header in content”, number of treads -50. It will be used LinksList id30.txt database. So it will look like:
3) Press “Run”. Process is faster, than in previous example (40treads/sec, instead of 12 treads/sec as was in previous example), because search is made only in topic of content (since “200 OK” is only in topic):
In fact, in resulting database (LinksList id30_mod.txt) are saved almost all links, because most of them are working (from 1357 links working links are 1256 links). All links where appear “404 Not Found” or where host is banned – are filtered
As you could see this tool can be used in many different ways. Success in Your experiments!
Usage of Sorting by PR tool in Hrefer
by Xrumer_Pros on Jan.26, 2010, under Xrumer Tips
Unfortunately, some users of Hrefer have problems with Sorting by PR tool, because of incorrect usage. In this lesson it will be detailed described how correctly to use this tool.
Phase 1: Preparing database, set-up
1) If database is not in subfolder Links , then it should be copied there. It is not necessary but it is recommended. For example my database is named New_test.txt
2) Start Hrefer. Enter in menu “Tools > Sort current links database by PR”. It is not necessary to update proxies.
3) Use default settings. At 5 Mb you can use more treads as usually. For example I use 50 treads.
4) Method of sorting is “Standard”
It should be like that:
Attention:
Path to database is not entered. It is indicated only name of database.
Phase 2 : Start sorting
1) Press “Run” button.
It will begin with first part of process. Hrefer will determine PageRank to all links from database, but sorting by PR it is not started yet. All PR values are stored in TEMPORARY file PRList.txt. (Attention: this txt file is temporary one and not with final results.) If you have made PR sorting earlier and you have saved old PRList.txt, you must wait till all saved data will be downloaded. PRList.txt work as basic file.
Process of this phase looks like this:
2) At the moment when progress bar will end it mean that Hrefer have determined PR of all links from database and it is ready to make sorting by PR. It will appear this window messages:
If you want to make sorting by PR then you should press “YES”. Only after this sorting by PR will begin. In my case New_test.txt database will be sorted. Future sorting will be made in some seconds.
Phase 3: Checking
1) Close Hrefer.
2) Open Links folder.
3) Open the database , but not PRList.txt
Evidently that at the beginning of database there are URLs with higher PR.
http://web.mit.edu/bjblair/www/wedding/cgi-bin/guestbook/guestbook.cgi http://www.slac.stanford.edu/econf/editors/submit.html http://www.whitehouse.gov/stateoftheunion/2002/guest.html http://www.duke.edu/web/saturdaynight/guestbook.html http://www.indiana.edu/~fsg/guestbook/addguest.html http://www.ucsd.edu/help/addlink.html http://pages.ebay.com/motors/finance/addlink.html http://www.quirksmode.org/dom/tests/add.html http://www.umich.edu/~fceiaa/add.html
at the end of database will be with lowest PR:
http://www.adult4us.com/cgi-bin/add.cgi http://www.worldwidehelp.org/petitionen/villamartin/eng/eintrag.html http://www.dentarg-alliance-gold.cn/addlink.php http://nordlichter.mausethal.de/gaestebuch.php http://www.urlaub-in-k%C3%BChlungsborn.de/Gaestebuch/Gaestebuch.php?action=impressum http://radioheinz.de/gaestebuch.php http://www.ge2004.it/default.asp?id=376&lingua=ENG http://m20.kosin.org/ttboard/ttboard.cgi?act=list&bname=SY_NOTICE
It was just an example but you can use it if you’ll have some difficulties to sort by PR.
Method of sorting could be used different as I gave you in this example. It is possible to separate database in 10 different files in dependence of PR by using “Multisort” method and not “Standard”.
During sorting by PR it is not necessary to use proxies. Hrefer can determine well PR without any proxies, but in some cases using proxies during sorting can be faster.
Short FAQ about this lesson
Question: “How is better to sort URLs by hostname or using full link?”
Answer : Using full link at sorting is not every time well, because your post can be in different place. For example: Full link to form have PR=0, but the page with posts have PR=5.
How to get Targeted Link Lists
by Xrumer_Pros on Jan.26, 2010, under Xrumer Tips
As an example, our goal is to get a super TARGETED list for “POKER”:
a) English blogs
b) English forums
So what are you doing?
1) Add words from google -> Bullshit, it will add every kinds of word, but probably nothing related to POKER
2) Add word from text file -> Bullshit, too. You will get a crappy list of all kind of topics.
So what are the BEST PRACTISES:
A) Go to the GOOGLE KEYWORD TOOL (https://adwords.google.de/select/KeywordToolExternal) and get word suggestions for POKER. ADD *ALL* keywords to your list of keywords (button “Add All”). Also add longtails like 4-5 word combinations which do not have any traffic at all. You can add here probably 100 keywords.
B) Look at your keywords list which you see in Google Keyword Tools. Which are OTHER good keywords ? Another good keyword would be for example “holdem”. So paste “holdem” into the Keyword Tool External and get another +~100 keyword suggestions. (you should have gotten ~200 keywords now)
C) Repeat B) untill you have got about 5000 longtail keywords in your list. Then download this list. (keywords.txt)
D) Use this keywords.txt list DIRECTLY as WORDLIST for HREFER. Each line contains MORE than 1 word, but this is perfectly ok for HREFER.
E) Choose Google and/or Google.Blogsearch (and other) and start harvesting.
From your 5000 Poker longtail keywordlist you should be able to get at least 20-30k highly targeted blogs for BLOG COMMENTING (the same is true if you look for forums).
You will see that about 50% of your harvested links are HIGHLY (!) targeted.
Xrumer Tips & Tricks
by Xrumer_Pros on Jan.25, 2010, under Xrumer Tips