Cannot post forms in Scrapebox

PandaBusters

Registered Member
Joined
Jun 10, 2017
Messages
71
Reaction score
6
I am trying to post to 31k forms in Scrapebox, error log produces the following errors:

11/12/2017 3:54:48 PM: HTTP: 403 Forbidden, SOCKET: , URL: http://www.houseofmagnets.com/football/index1.aspx?team=Raiders
11/12/2017 3:54:51 PM: HTTP: 200 OK, SOCKET: , URL: https://www.dickssportinggoods.com/c/nfl-fan-shop, Description: Redirect
11/12/2017 3:54:54 PM: HTTP: 500 , SOCKET: Connection timed out, URL: https://www.intelius.com/people/Derrick-Carr/Aurora-CO/0BN0SWQX0WA
11/12/2017 3:54:55 PM: HTTP: 503 Service Unavailable, SOCKET: , URL: http://www.truck-and-car-floor-mats.com/raider-truck-caps.html
11/12/2017 3:54:57 PM: HTTP: 999 Request denied, SOCKET: , URL: https://za.linkedin.com/in/derrick-harrison-9a272027
11/12/2017 3:54:58 PM: HTTP: 0 , SOCKET: Connection timed out, URL: http://www.thisismyindia.com/forum/forum/current-affairs
11/12/2017 3:55:12 PM: HTTP: 400 BAD_REQUEST, SOCKET: , URL: https://broom02.revolvy.com/topic/1973 Pittsburgh Steelers season
11/12/2017 3:55:20 PM: HTTP: 503 Service Unavailable, SOCKET: , URL: https://myaccount.raiders.com/

Poster status in error logs (like above, I am only showing a few url results):
11/12/2017 3:54:47 PM: ERROR: Platform not detected, URL: http://www.newslocker.com/en-us/sport/oakland-raiders/jack-del-rio-foundation-supports-mexico-city/, PROXY: 192.186.151.143:50409
11/12/2017 3:54:48 PM: ERROR: Platform not detected, URL: http://www.maximtickets.com/oakland-raiders-tickets.aspx, PROXY: 23.245.44.205:15606
11/12/2017 3:54:50 PM: ERROR: Unable to detect form/url, URL: https://thewoolenneedle.com/product/raider-red-2/, PROXY: 192.186.178.98:50409
11/12/2017 3:54:50 PM: ERROR: Platform not detected, URL: http://jerrygarcia.com/albums/?bid=3519, PROXY: 23.245.44.205:15606
11/12/2017 3:54:51 PM: ERROR: Platform not detected, URL: https://247sports.com/nfl/oakland-r...ke-trade-at-2017-NFL-trade-deadline-109763250, PROXY: 23.245.44.205:15606
11/12/2017 3:54:51 PM: ERROR: Platform not detected, URL: https://archive.org/details/mbid-52795e88-64ff-401d-87c1-1d172df8caa0, PROXY: 209.54.42.154:33418
11/12/2017 3:54:52 PM: ERROR: Platform not detected, URL:
, PROXY: 23.245.44.205:15606
11/12/2017 3:54:52 PM: ERROR: Platform not detected, URL: http://www.tulsaworld.com/sportsextra/osusportsextra/award-winners/, PROXY: 196.196.163.192:28379
11/12/2017 3:54:53 PM: ERROR: Platform not detected, URL: https://bryanhallsawakening.wordpre...-puerto-rico-truth-catalonia-another-dead-dr/, PROXY: 209.54.42.154:33418
11/12/2017 3:54:54 PM: ERROR: Platform not detected, URL: https://www.amazon.com/Pop-Music-Oldies/b?ie=UTF8&node=284031, PROXY: 192.186.178.98:50409
11/12/2017 3:54:55 PM: ERROR: Platform not detected, URL: https://www.spokeo.com/Jack-Elliot/Famous-Songwriter, PROXY: 209.54.42.154:33418

What am I doing wrong? I have 10 private proxies and death by captcha. Thanks.
 

Sweetfunny

Jr. VIP
Jr. VIP
Premium Member
Joined
Jul 13, 2008
Messages
2,154
Reaction score
5,384
Website
www.scrapebox.com
Pretty much all your URL's can't be commented on, you have pages from LinkedIn, Archive.org, Amazon, DicksSportingGoods etc. Even the Wordpress URL you have requires you to be logged in via Facebook/Google etc. So it seems like you just harvested random URL's, you need to use footprints to target the platforms that can accept posts, there's footprints built in click the "Platforms" button on the harvester.

If you are just starting out, Looplines video's are essential viewing https://www.youtube.com/user/looplinescrapebox/videos
 

satyr85

Jr. VIP
Jr. VIP
Joined
Aug 7, 2011
Messages
1,210
Reaction score
1,102
What am I doing wrong?
Looks like everything. Look for tutorials around forum. When next time you are not able to post to some site, check by hand if there is posting form, captcha etc.
Btw - in logs you have exact answer why SB was not able to post, if you dont understand logs you wont get anywhere.
 

PandaBusters

Registered Member
Joined
Jun 10, 2017
Messages
71
Reaction score
6
Thanks, I had trouble understanding what footprints are, the more common urls probably have css/xml strings that require serious regex robot training to overcome.
 

satyr85

Jr. VIP
Jr. VIP
Joined
Aug 7, 2011
Messages
1,210
Reaction score
1,102
1. Find page with form you want to post.
2. Look for something very unique on this page, for example "email"+"leave a comment"+"your name" - thats single footprint.
3. Type this footprint into google and check what results you get, check all pages by hand in top 20-30, check if every page or most of pages have posting form - if yes footprint is good.
4. Harvest target sites using footprints + keywords list.
5. [Optional] - run harvested list through GSA platform identifier or similar software - here you would need to create custom engine to filter sites in way you want.
6. Post

Edit:
Harvesting and positng to forms with very heavy protection, or with custom fields is hardest part here.
To harvest you need toons of proxies, private proxies wont help you alot here, public proxies are way to go. @proxygo is your guy if you need proxies for harvesting. One time payment and your harvesting proxy needs are covered.
 
Last edited:

loopline

Jr. VIP
Jr. VIP
Joined
Jan 25, 2009
Messages
5,950
Reaction score
3,383
Website
contactformmarketing.com
Thanks, I had trouble understanding what footprints are, the more common urls probably have css/xml strings that require serious regex robot training to overcome.
A footprint is just whatever you put in the search box in google.com in a browser. Don't get hung up on the name of "footprint"

cars

Is technically a footprint. Or you could say

cars "red"

is a footprint because you want red cars.

Also

cars "red" inurl:cars "powered by wordpress" -green -blue

is a footprint. So whatever you put in the search box in google to get back what you want, just put that in the keywords box in scrapebox and you get back the same. So basically

footprint = whatever you need to search for in the search engines to get back what you want.

It lets you target. As SweetFunny said above, it lets you dial in the results.
 

PandaBusters

Registered Member
Joined
Jun 10, 2017
Messages
71
Reaction score
6
If I'm inferring correctly, proper searching with the right search terms and modifiers means you're trying to find pages where form field names, engine/platform type, unique keywords that tie together forms in common that are more likely to take your posting data (name, e-mail, subject, comment, etc.) and with proper footprint habits, conversion rates will be higher? I am still confused as to how to use the footprint radio button on Scrapebox, and when to "push" it instead of platform radio button. Thanks. Jim.
 

SEO

Jr. VIP
Jr. VIP
Joined
Jan 6, 2017
Messages
942
Reaction score
749
If I'm inferring correctly, proper searching with the right search terms and modifiers means you're trying to find pages where form field names, engine/platform type, unique keywords that tie together forms in common that are more likely to take your posting data (name, e-mail, subject, comment, etc.) and with proper footprint habits, conversion rates will be higher? I am still confused as to how to use the footprint radio button on Scrapebox, and when to "push" it instead of platform radio button. Thanks. Jim.
A lot of people use Scrapebox for more than just posting. I use it for tons of things like locating sources for quotes, or scraping all of the pages of a single domain. In those instances, you would use the "Custom Footprint" radio.

If I'm looking for blogs to comment on, or in your case, forms to submit using the "poster" feature, you need to choose a platform that Scrapebox understands. In this case, you would select the "Platform" radio, then click the "Platforms" button and choose all of the platforms you want Scrapebox to search for.

Loopline, who has responded to you on this thread, has a TON of tutorials all over YouTube for you to learn how to use the tool effectively. He has numerous "over-the-shoulder" videos that walk you through scraping, filtering and posting. Scrapebox does a lot, so keep using it and you'll find a ton of tasks you can get done faster because of it.
 

PandaBusters

Registered Member
Joined
Jun 10, 2017
Messages
71
Reaction score
6
Thanks, I'm watching the videos now. I am trying to get the conversion rate to 50% like I had with GSA and Rankwyz, now at 1%.
 

PandaBusters

Registered Member
Joined
Jun 10, 2017
Messages
71
Reaction score
6
Still no success, 0% conversion rate after going down, once 25% (did I burn my private proxies, I started with only 10?), with death by captcha, which does
not let me test death by captcha in Scrapebox, but there are no recaptcha rejections as most these forms ( I did what you said, checked the top 30 results) have no robot blockers.

The first few keyword combinations I would use to harvest with (no public proxies) are:
allintext:"powered by wordpress" "name" "e-mail" "contact" "message" "comment" "Eigen vector Eigenvector"
allintext:"powered by wordpress" "name" "e-mail" "contact" "message" "comment" "seo black hat"
allintext:"powered by wordpress" "name" "e-mail" "contact" "message" "comment" "back link black hat"
allintext:"powered by wordpress" "name" "e-mail" "contact" "message" "comment" "cpa pay per click"
allintext:"powered by wordpress" "name" "e-mail" "contact" "message" "comment" "search engine cost per link"
allintext:"powered by wordpress" "name" "e-mail" "contact" "message" "comment" "Penguin Herbalife"
allintext:"powered by wordpress" "name" "e-mail" "contact" "message" "comment" "search engine black hat"
allintext:"powered by wordpress" "name" "e-mail" "contact" "message" "comment" "cost per link Melaleuca"
allintext:"powered by wordpress" "name" "e-mail" "contact" "message" "comment" "search engine marketing black hat"

The errors on the poster status page include:
11/14/2017 5:30:41 AM: HTTP: 404 Not Found, SOCKET: , URL: http://www.tengqiangyueqi.com/Shownews.asp?id=987
11/14/2017 5:30:42 AM: HTTP: 200 OK, SOCKET: Connection timed out, URL: http://www.ygsp.ro/comments.php?article_id=0825
11/14/2017 5:30:44 AM: HTTP: 503 Service Unavailable, SOCKET: , URL: http://mail.fuglen.com/birdlife/2013/sep/13/fuglen-monocle-guide-better-living/
11/14/2017 5:30:46 AM: HTTP: 200 OK, SOCKET: Connection timed out, URL: http://www.ygsp.ro/comments.php?article_id=0194
11/14/2017 5:30:49 AM: HTTP: 404 Not Found, SOCKET: , URL: https://rightforever.com/2017/03/28/trumps-new-air-force-one-boeing-rolled/
11/14/2017 5:30:49 AM: HTTP: 200 OK, SOCKET: Connection timed out, URL: http://www.mayuhana.com/blog/2008/08/post_11.html
11/14/2017 5:30:52 AM: HTTP: 0 , SOCKET: Connection timed out, URL: http://blog.sanriotown.com/abbyfries:hellokitty.com/2008/11/02/happy-birthday-hello-kitty/
11/14/2017 5:30:52 AM: HTTP: 200 OK, SOCKET: Connection timed out, URL: http://www.ygsp.ro/comments.php?article_id=0594
11/14/2017 5:30:54 AM: HTTP: 503 Service Unavailable, SOCKET: , URL: http://www2.dokidoki.ne.jp/piyoromu/keijibann/tnote.cgi?book=book4&from=1&to=28299
11/14/2017 5:30:57 AM: HTTP: 200 OK, SOCKET: Connection timed out, URL: http://www.levanteblu.it/guest/elenco.php
11/14/2017 5:30:59 AM: HTTP: 523 Origin Unreachable, SOCKET: , URL: https://archive.tsearch.eu/search.php?q= Build A Website
11/14/2017 5:31:03 AM: HTTP: 0 , SOCKET: Connection timed out, URL: http://synergytanningnorthyork.com/why-vitamin-d-is-good-for-you/
11/14/2017 5:31:03 AM: HTTP: 200 OK, SOCKET: , URL: http://chalakudi.groundtruth.in/reports/view/5

and for poster status errors (I used all guestbooks and all contact forms) I got:
11/14/2017 5:30:27 AM: ERROR: Platform not detected, URL: https://www.phpbb.com/downloads/, PROXY: 23.245.44.205:15606
11/14/2017 5:30:27 AM: ERROR: Platform not detected, URL: http://andikabea.blogspot.co.id/2015/06/, PROXY: 107.172.74.252:51108
11/14/2017 5:30:28 AM: ERROR: Platform not detected, URL: http://www.ericstips.com/tips/small-business-dependency/, PROXY: 192.186.151.143:50409
11/14/2017 5:30:28 AM: ERROR: Platform not detected, URL: http://www.sorethumbsblog.com/fe.html, PROXY: 209.54.42.154:33418
11/14/2017 5:30:29 AM: ERROR: Platform not detected, URL: http://stadtbranche.ch/thema-consulting, PROXY: 23.245.44.205:15606
11/14/2017 5:30:30 AM: ERROR: Platform not detected, URL: http://straseni.unimedia.info/news/...gina-web-oficiala.jpg-1924.html?mod=page&id=5, PROXY: 209.54.42.154:33418
11/14/2017 5:30:31 AM: ERROR: Platform not detected, URL: http://blog.twitt-erfolg.de/2017/06/die-neueste-traffic-technologie-in-5.html, PROXY: 107.172.74.252:51108
11/14/2017 5:30:32 AM: ERROR: Platform not detected, URL: http://www.kirjastuskunst.ee/catalo...t&rID=26657d5ff9020d2abefe558796b99584&reit=3, PROXY: 192.186.178.98:50409
11/14/2017 5:30:33 AM: ERROR: Platform not detected, URL: https://filter.pequeavalley.org/access/web/login?id=CRX9SC3FHJG15CM7KSRR1Y3PTTSO4MD7, PROXY: 192.186.151.143:50409
11/14/2017 5:30:33 AM: ERROR: Platform not detected, URL: https://filter.pequeavalley.org/access/web/login?id=DEV2KJRFWOSYQSZZWZTN68N5D7OFX7BT, PROXY: 192.186.178.98:50409
11/14/2017 5:30:33 AM: ERROR: Platform not detected, URL: https://filter.pequeavalley.org/access/web/login?id=QRKHXXOQE2AV17D8RGMJV3XX7HKJBZUY, PROXY: 107.172.74.252:51108
11/14/2017 5:30:37 AM: ERROR: Unable to detect form/url, URL: http://www.ncmissingpersons.org/2015/01/, PROXY: 206.223.232.163:1628
11/14/2017 5:30:39 AM: ERROR: Platform not detected, URL: http://pfaffkorea.co.kr/front/php/product.php?product_no=175&main_cate_no=24&display_group=1, PROXY: 107.172.74.252:51108
11/14/2017 5:30:39 AM: ERROR: Platform not detected, URL: http://thechive.com/category/girls/, PROXY: 78.157.223.174:52963
11/14/2017 5:30:40 AM: ERROR: Platform not detected, URL: https://www.oxwall.com/about, PROXY: 196.196.163.192:28379
11/14/2017 5:30:42 AM: ERROR: Platform not detected, URL: https://pastebin.com/V5q2H5um, PROXY: 196.196.163.192:28379
11/14/2017 5:30:43 AM: ERROR: Platform not detected, URL: http://www.homeimprovement.ga/electrician/electrician_nwe.php, PROXY: 78.157.223.174:52963
11/14/2017 5:30:45 AM: ERROR: Platform not detected, URL: http://arwaa77.mtjre.com/product-40783.html, PROXY: 206.223.232.163:1628
11/14/2017 5:30:45 AM: ERROR: Platform not detected, URL: https://myemma.com/login, PROXY: 192.186.151.143:50409
11/14/2017 5:30:46 AM: ERROR: Platform not detected, URL: https://www.budgetmobile.com/, PROXY: 206.223.232.163:1628
11/14/2017 5:30:55 AM: ERROR: Platform not detected, URL: https://filter.pequeavalley.org/access/web/login?
id=W980EWFOHCIXJAX4MKWLCQ62EZGGM7L5, PROXY: 196.196.163.192:28379

The videos suggest 100 proxies, I want to use four pc's to harvest/post on a 24/7 basis. I did contact the public proxy guy, @proxygo, he asked for my skype, gave it to him, have to wait to hear back. Thank you. Jim Wood.
 

loopline

Jr. VIP
Jr. VIP
Joined
Jan 25, 2009
Messages
5,950
Reaction score
3,383
Website
contactformmarketing.com
If I'm inferring correctly, proper searching with the right search terms and modifiers means you're trying to find pages where form field names, engine/platform type, unique keywords that tie together forms in common that are more likely to take your posting data (name, e-mail, subject, comment, etc.) and with proper footprint habits, conversion rates will be higher? I am still confused as to how to use the footprint radio button on Scrapebox, and when to "push" it instead of platform radio button. Thanks. Jim.

So yes, you are correct about the concept of increasing success rates by scraping more targeted platforms that are likely to accept your data. the footprint button is if you want to use the built in footprints that scrapebox offers.

in that footprints section scrapebox offers footprints for all the platforms its capable of posting to. So I would start there and select the platforms radio button and then click the rectangle platforms button- then select the platforms you want to post to. Then add your keywords and scrapebox will tack on the required footprints for each platform to your keywords.

Then post.
 
Top