1. This site uses cookies. By continuing to use this site, you are agreeing to our use of cookies. Learn More.

Exclude domains from an email list

Discussion in 'Black Hat SEO Tools' started by Roger1979, May 8, 2015.

  1. Roger1979

    Roger1979 Newbie

    Joined:
    Apr 4, 2015
    Messages:
    12
    Likes Received:
    1
    Hello fellows,
    I have a big email list (text) and another text file containing domains I want to exclude, in the following format:
    domain1
    domain2

    Anyone knows what software I can use to parse the file containing the emails and exclude all the lines containing the domains from the second file? There are about 1000 domains I want to exclude, so I need the name of a decent software (not maxprog's apps pls).
    Thank you all in advance.
     
  2. qrazy

    qrazy Senior Member

    Joined:
    Mar 19, 2012
    Messages:
    1,115
    Likes Received:
    1,725
    Location:
    Banana Republic
    If the file isn't large, just use this to load the file and search using regular expression like domain1.com|domain2.abc|domain3.xxx ...

    If the file is too large, you could use the regular expression with editors like editplus or notepad++ to select the lines containing your exclude words and remove them.

    If you know programming it shouldn't take more than 5 mins to do this in any language.
     
  3. Roger1979

    Roger1979 Newbie

    Joined:
    Apr 4, 2015
    Messages:
    12
    Likes Received:
    1
    qrazy, thank you. The file/s are big ...I don't know programming, but I'll give Notepad++ a shot. Hopefully I can work it out. Thanks.

    Edit: Ops, I guess I need to learn some programming, because I don't see where I can add 1000 filters using Notepad++, I know how to add a filter, menu, mark, bookmark...etc, but adding 1000 seems impossible. Well, I guess I'm not thinking outside the box.
     
    Last edited: May 8, 2015
  4. Roger1979

    Roger1979 Newbie

    Joined:
    Apr 4, 2015
    Messages:
    12
    Likes Received:
    1
    I wish ListMotor would be cheaper...:headache:
     
  5. Sweetfunny

    Sweetfunny Jr. VIP Jr. VIP

    Joined:
    Jul 13, 2008
    Messages:
    1,793
    Likes Received:
    5,073
    Location:
    ScrapeBox v2.0
    Home Page:
    When i saw this i though i wonder if the Manage URL Lists features in ScrapeBox can be re-purposed to do this and scrub email lists, and the answer is yes.

    In ScrapeBox v2 you can load the emails in to the harvester grid, and use "Remove URL's containing entries from". Here i had 160 random scraped emails which had a few staples.com and .govt.nz emails

    [​IMG]

    Then select a text file with the domains, extensions, or any text like [email protected] to remove any emails containing these strings

    [​IMG]

    And i was left with 123 emails without any from staples.com or .govt.nz domains. The ScrapeBox email scraper can do this filtering in real time, but this is how you can do it after the fact if you already have a list you want to scrub. You could also do this with keywords lists instead of emails or urls to filter out low paying or bad keywords etc.
     
  6. qrazy

    qrazy Senior Member

    Joined:
    Mar 19, 2012
    Messages:
    1,115
    Likes Received:
    1,725
    Location:
    Banana Republic
  7. Roger1979

    Roger1979 Newbie

    Joined:
    Apr 4, 2015
    Messages:
    12
    Likes Received:
    1
    Sweet funny, thank you for the info. How much does that app costs?

    qrazy, thanks a bunch man. I will try that app and let you know how it went!

    edit: qrazy : The application was unable to start correctly (0xc000007b). Click OK to close the application.
    I've also tried to run it as Administrator. Same error. I'm running Windows 7 x64. Maybe this is the problem? Are there any requirements for running this app? Thanks!

    edit 2: I'll hit the bed. Talk soon.
    :drinking2
     
    Last edited: May 8, 2015
  8. qrazy

    qrazy Senior Member

    Joined:
    Mar 19, 2012
    Messages:
    1,115
    Likes Received:
    1,725
    Location:
    Banana Republic
    It requires .NET 4.5 framework
     
  9. Sweetfunny

    Sweetfunny Jr. VIP Jr. VIP

    Joined:
    Jul 13, 2008
    Messages:
    1,793
    Likes Received:
    5,073
    Location:
    ScrapeBox v2.0
    Home Page:
    Last edited: May 8, 2015
  10. HoNeYBiRD

    HoNeYBiRD Jr. VIP Jr. VIP

    Joined:
    May 1, 2009
    Messages:
    7,313
    Likes Received:
    8,280
    Gender:
    Male
    Occupation:
    Geographer, Tourism Manager
    Location:
    Ghosted
    you don't need any programming knowledge to do this with notepad++

    1. separate the 1000 domains with an "|" (Alt+W):
    use search and replace -> replace tab, make sure that regexp or extended is selected, assuming you have a domain in each row:
    domain1
    domain2
    domain3

    ...
    search for: \n\r
    replace with: | (Alt+W)

    if you have your domains separated by a comma and an empty space:
    domain1, domain2, domain3...
    search for: , (a comma and an empty space)
    replace with: | (Alt+W)

    copy the result to the clipboard, which should be a long, single line
    domain1|domain2|domain3...

    2. now go to the highlight tab of the search and replace popup window, tick the "highlight rows" box, make sure that regexp is selected
    search for: paste your copied content from above
    it will highlight/bookmark all the rows which contain any of the domains
    then on the main search tab -> bookmarks/delete bookmarked rows depending

    disclaimer: i use notepad++ on my native language, so maybe the names of some of the functions are sligthly different, but you got the picture i think

    actually it's harder to explain than it's done :)
     
    Last edited: May 8, 2015
  11. botrockets

    botrockets Regular Member

    Joined:
    Mar 16, 2013
    Messages:
    355
    Likes Received:
    550
    Gender:
    Male
    Occupation:
    Entrepreneur
    Location:
    BotRockets
    • Thanks Thanks x 1
    Last edited: May 8, 2015
  12. Roger1979

    Roger1979 Newbie

    Joined:
    Apr 4, 2015
    Messages:
    12
    Likes Received:
    1
    I want to thank you all very much for helping me with this issue. Before going to bed I managed to get the job done with ListMotor free trial ($150 is too pricey though), and I will probably buy soon Scrapebox. Until then, if I will ever need help with something like this I will use qrazy's or botrocket's tool.

    Great community.
     
  13. SEOking222

    SEOking222 Jr. VIP Jr. VIP

    Joined:
    Sep 29, 2012
    Messages:
    1,394
    Likes Received:
    165
    Occupation:
    LOL ya right
    Location:
    Ghosted