Marc Najork
Marc Najork
Research Engineering Director, Google LLC
Email verificata su google.com - Home page
TitoloCitata daAnno
Detecting spam web pages through content analysis
A Ntoulas, M Najork, M Manasse, D Fetterly
Proceedings of the 15th international conference on World Wide Web, 83-92, 2006
7762006
Mercator: A scalable, extensible web crawler
A Heydon, M Najork
World Wide Web 2 (4), 219-229, 1999
7511999
A large‐scale study of the evolution of Web pages
D Fetterly, M Manasse, M Najork, JL Wiener
Software: Practice and Experience 34 (2), 213-237, 2004
7322004
Breadth-first crawling yields high-quality pages
M Najork, JL Wiener
Proceedings of the 10th international conference on World Wide Web, 114-118, 2001
5572001
Spam, damn spam, and statistics: Using statistical analysis to locate spam web pages
D Fetterly, M Manasse, M Najork
Proceedings of the 7th International Workshop on the Web and Databases …, 2004
4262004
Web crawling
C Olston, M Najork
Foundations and Trends® in Information Retrieval 4 (3), 175-246, 2010
3912010
On near-uniform URL sampling
MR Henzinger, A Heydon, M Mitzenmacher, M Najork
Computer Networks 33 (1-6), 295-308, 2000
3242000
Boxwood: Abstractions as the Foundation for Storage Infrastructure.
J MacCormick, N Murphy, M Najork, CA Thekkath, L Zhou
OSDI 4, 8-8, 2004
2532004
On the evolution of clusters of near-duplicate web pages
D Fetterly, M Manasse, M Najork
Proceeding of the 1st Latin American Web Congress, 37-45, 2003
1922003
Measuring index quality using random walks on the Web
MR Henzinger, A Heydon, M Mitzenmacher, M Najork
Computer Networks 31 (11-16), 1291-1303, 1999
1921999
High-performance web crawling
M Najork, A Heydon
Handbook of massive data sets, 25-45, 2002
1802002
Detecting phrase-level duplication on the world wide web
D Fetterly, M Manasse, M Najork
Proceedings of the 28th annual international ACM SIGIR conference on …, 2005
1722005
SOCIAL NETWORK RECOMMENDED CONTENT AND RECOMMENDING MEMBERS FOR PERSONALIZED SEARCH RESULTS
T Harrington, R Shenoy, M Najork, R Panigrahy
US Patent App. 13/252,215, 2013
1602013
System and method for associating an extensible set of data with documents downloaded by a web crawler
MA Najork, CA Heydon
US Patent 6,351,755, 2002
1582002
Systems and methods for ranking documents based upon structurally interrelated information
MA Najork
US Patent 7,739,281, 2010
1442010
Web crawler system using plurality of parallel priority level queues having distinct associated download priority levels for prioritizing document downloading and maintaining …
MA Najork, CA Heydon, JL Wiener
US Patent 6,263,364, 2001
1432001
A sketch-based distance oracle for web-scale graphs
A Das Sarma, S Gollapudi, M Najork, R Panigrahy
Proceedings of the third ACM international conference on Web search and data …, 2010
1332010
Efficient URL caching for world wide web crawling
AZ Broder, M Najork, JL Wiener
Proceedings of the 12th international conference on World Wide Web, 679-689, 2003
1252003
Algorithm animation using 3d interactive graphics
MH Brown, MA Najork
Proceedings of the 6th annual ACM symposium on User interface software and …, 1993
1161993
HITS on the Web: How does it Compare?
MA Najork, H Zaragoza, MJ Taylor
Proceedings of the 30th annual international ACM SIGIR conference on …, 2007
982007
Il sistema al momento non può eseguire l'operazione. Riprova più tardi.
Articoli 1–20