1. This site uses cookies. By continuing to use this site, you are agreeing to our use of cookies. Learn More.

[Help] need tool to clean my txt file email list from strange characters and other things

Discussion in 'Black Hat SEO Tools' started by king master, Oct 27, 2014.

  1. king master

    king master Jr. VIP Jr. VIP Premium Member

    Joined:
    Jan 12, 2011
    Messages:
    635
    Likes Received:
    257
    Gender:
    Male
    Occupation:
    Industrial Engineer - SEO
    Location:
    Masr
    Hi dear friends , i'v email list and need to sort and clean for :

    1- bad character eg : ^tt
    2- some lines without the comma , this results email@hotmail.comsecondmail@hotmail.com
    3- some spaced lines (empty)

    #all emails for one webmail eg : hotmail

    # i want it to be clean and sorted one email per one line

    email1@hotmail.com
    email2@hotmail.com

    this may be simple , i tried in google with no luck , i didn't sleep for 20 hrs and my brain is boom

    Thanx alot :)

    # i tried to remove duplicate lines in scrapebox but i think not good enough
    # i download notpad++
     
  2. king master

    king master Jr. VIP Jr. VIP Premium Member

    Joined:
    Jan 12, 2011
    Messages:
    635
    Likes Received:
    257
    Gender:
    Male
    Occupation:
    Industrial Engineer - SEO
    Location:
    Masr
    i didn't spend 20 hrs searching for solution :) i was doing my IM work after my full time work (day job) ;

    anyway thanx for your reply
     
  3. Repulsor

    Repulsor Power Member

    Joined:
    Jun 11, 2013
    Messages:
    712
    Likes Received:
    267
    Location:
    PHP Scripting ;)
    You will probably need regex to filter out the non alphanumberic charecters, and then use some string functions to get things to the format that you are looking for.

    I highly doubt something like that exists already. If nothing else works, contact me.
     
  4. Blue_Monk

    Blue_Monk Registered Member

    Joined:
    Feb 21, 2011
    Messages:
    51
    Likes Received:
    35
    Occupation:
    If you find out i'll have to kill you.
    Location:
    UK
    Code:
    Warning It's easier than it looks and once you get the hang of it you'll have a lot of fun with it. 
    TextPipe Pro - basic idea of steps needed steps below                          
    1. Remove all blank lines                          
    2. Remove string "^tt" - duplicate this line for all strings you need deleted                          
    3. Remove string ""                           
    4. Remove string " "                           
    5. Insert line feed after all ".com" strings (copy paste this step for all TLD's you might have in there)                          
    6. Remove all blank lines again                          
    7. Remove duplicate lines ( this will remove duplicate e-mails)    
    
     
    Last edited: Oct 28, 2014
  5. misteryou.

    misteryou. Power Member

    Joined:
    Feb 1, 2012
    Messages:
    573
    Likes Received:
    109
    http://textmechanic.com/Find-and-Replace-Text.html

    1) for remove characters

    - find box: your aliens characters
    - replace with: press space for nothing

    2) correct email output

    - find box: .com
    - replace with: press space + .com

    result: email@hotmail*com secondmail@hotmail*com

    3) add a line break

    copy result here http://textmechanic.com/Add-Remove-Line-Breaks.html
    make a new line break after .com
    result:
    email@hotmail*com
    secondmail@hotmail*com

    4) remove duplicate line http://textmechanic.com/Remove-Duplicate-Lines.html
     
    • Thanks Thanks x 1
    Last edited: Oct 29, 2014
  6. linuxsmtp

    linuxsmtp Regular Member

    Joined:
    Feb 13, 2014
    Messages:
    455
    Likes Received:
    65
    Location:
    Philippines
    i have a good tools for this, one click clean