Sartre
Ignored a dispute resolution thread against them.
- Apr 1, 2010
- 1,672
- 4,573
1. I'm scraping Google using a f*** ton of 64-core servers for a very long time 24/7Great read @Sartre.
I wanted to pick your brain on a couple of details.
1. How did you acquire this database and where could I get one?
2. Can you elaborate more on your processs in relation to analyzing topical relevancy to the keyword / snippet - using spacy or something similar I take it?
3. If so, is it in reach for you to share a code snippet of just this?
4. I hope this makes sense, but how would the score from your app be represented - for a keyword with multiple UGC's (user gen content) would you calculatae in the sense that you decrease the difficulty score for each of UGC on the front page?
5. I see, so you don't use AI generated content on them?
Thanks.
2. I covered this in detail somewhere in this thread.
3. My app is over 10k lines of code. Unfortunately can't just share a snippet, as it depends on my custom classes and functions.
4. Yeah, exactly I decrease difficulty for each UGC, and the higher in position a particular link is, the higher relative power it has. I took the calculations from CTR in Google. It's mostly a function of inverse Φ - a 38% reduction for every position. If I remember correctly.
5. I use only AI generated content from now on. I also revamped almost everything when it comes to article generation after the last update. I will write an update on this soon.
well, it's much more complex than just that at this point, so I really can't help you. Especially after the changes since the last Google update.Not, to find relevant sentences/paragraphs in Google top articles
By the way, if it's not difficult, tell me what coefficient you use to remove duplicates