Editing a big list, how to remove certain URLs?

dgfalk

Power Member
Joined
Apr 26, 2010
Messages
687
Reaction score
95
I have a big list of URLs that I want to sort out. About 1/4 of the urls have a specific keyword in the title that I want to remove. Is there anyway in Word or another program that I can say any URL that contains this "keyword" remove from the list?
 
Just Made a Quick Video on How to do this in Notepad++ (Watch in HD 1080p)

http://www.youtube.com/watch?v=TrcTcw_yipE

It will take a whole lot of time if your list size is bigger than 10 Megabyte. Alternatively you can do this easily in scrapebox. If this doesn't works for you, send it to me. I'll Process it within few seconds with my Custom Text Editor! :)
 
If you have any version of XL since 2003/5 or so you can use AutoFilter (toolbar button looks like a funnel with an equal sign) and say "contains" or "does not contain"
 
I do it in regex with notepad++.

Go to Search -> Replace , now make sure the Regular Expressions radio button is checked

If you have a keyword, for example KEYWORD1, type the following into the find box: ^.*KEYWORD1.*$

(In Regex ^ means beginning of line, $ means end of line, and .* means any old text)

and in the Replace box, leave it blank

Click Replace All

After the lines containing KEYWORD1 are deleted, press CTRL+A to select all, then go to TextFx -> TextFx Edit -> Delete Blank lines

It's quick and it works, although others may have different methods. I generally use this to clean up my harvested URL lists - even with the best of footprints for some reason I get a shitload of blogspot blogs which need to be eliminated! Peace out!
 
Back
Top