Scrapebox Question

datadyne · Aug 4, 2011

I know there is a guide out there, and I DID USE the search button but I couldn't figure one thing out....

How do I extract all links from one domain?
I have a list of 900 autoapprove domains and I want to get the the rest of the urls from those 900 domains.

Thanks in advance guys!

Roparadise · Aug 4, 2011

You could use xenu to get all the urls from a domain,im not sure how it can be done n scrapebox.

SuperLinks · Aug 4, 2011

Xenu would be the best way as long as you don't get banned by the domain for "ddos" protection.

Unfortunately getting all the URLs via Scrapebox requires using the Scrapebox search engine functionality which can cause some headaches.

Instead of doing a normal "scrape" session for keywrods, use the following to find the internal pages.

site:domain.com

Make sure that you aren't using footprints with that, otherwise you won't find all the URLs of that site. This should be a straight "search" within Scrapebox with no footprints.

Tenshisendo · Aug 4, 2011

SuperLinks said:
Instead of doing a normal "scrape" session for keywrods, use the following to find the internal pages.

site:domain.com

You can do it this way to find all indexed links.

Or you can go to addons and use the link extractor and set it to internal. This will find all links on the sites indexed or not. I suggest to run all your links through a couple times to find all of them.

Ex. Run your first pass save the list and load that list back in and run again--so on and so forth.

datadyne · Aug 4, 2011

It didnt work for me... I got 0 results, could you tell me what I did wrong?

I picked custom footprint and put in site:domain.com
And for my keywords I put in the 900 autoapprove domains, I got 0 results.

dooogen · Aug 4, 2011

Cut all the autoapprove urls down to just the domain. Remove duplicates.

Put site: in front of all the domains and paste them into the keywords spot without any custom footprint.

Then scrape and you will get all of the indexed pages for each domain. You can also add a comment footprint if you want only commentable pages.

hellohellosharp · Aug 4, 2011

Not to steal your thread, but I also have a Scrapebox question...

Do you guys always trim to root on your harvests? When i harvest I usually end up 5-6 posts per domain...

I was thinking the best way to do it would be NOT trim to root on the first blast (get multiple posts from each domain) and THEN trim to root and remove duplicates for future use of the list. Is that right?

KayTeePK · Aug 4, 2011

The best tutorial for noobs . Try this Scrapebox tutorials

Tenshisendo · Aug 5, 2011

datadyne said:
It didnt work for me... I got 0 results, could you tell me what I did wrong?

I picked custom footprint and put in site:domain.com
And for my keywords I put in the 900 autoapprove domains, I got 0 results.

What you need to do is trim all the sites to root domain then load them all in the keyword box.

After that make a txt file with nothing in it but "site:"without quotes.

Then press the little m next to footprint box and select the txt file you made.

This will make every domain in your keyword box setup as site:domain.com

trevormorley · Aug 5, 2011

Is there benefit in getting a link from every page on a site rather than just one? Surely big G doesn't count it multiple times?

typeslowly · Aug 5, 2011

hellohellosharp said:
Not to steal your thread, but I also have a Scrapebox question...

Do you guys always trim to root on your harvests? When i harvest I usually end up 5-6 posts per domain...

I was thinking the best way to do it would be NOT trim to root on the first blast (get multiple posts from each domain) and THEN trim to root and remove duplicates for future use of the list. Is that right?

I organize by domain, rather than individual URLS. Makes it much easier.

TheMatrix · Aug 5, 2011

Seo Lover said:
The best tutorial for noobs . Try this http://lmgtfy.com/?q=scrapebox+tutorials

That's the worst lmgtfy I've ever seen.

OP: I use the site:domain.com footprint to scrape all G indexed pages.

Scrapebox Question

datadyne

Junior Member

Roparadise

BANNED

SuperLinks

Elite Member

Tenshisendo

Registered Member

datadyne

Junior Member

dooogen

Newbie

hellohellosharp

Power Member

KayTeePK

Elite Member

Tenshisendo

Registered Member

trevormorley

Regular Member

typeslowly

Registered Member

TheMatrix

BANNED

Main Menu

Marketplace

Making Money

BlackHat World