Stefano Tommesani

  • Increase font size
  • Default font size
  • Decrease font size
Home Web Spidering Facebook public profiles with C++ and Boost

A software to stand out

Hits

Standing out of the pack starts by being visible, and being noticed by the right group of professionals. No matter how good your profile is, it is lost in a sea of similar profiles, so you need to show up and start attracting visits to your profile page.

Suppose we are interested in working at Ferrari. Let's search for professionals working there:

AttackInSearch

 

Then let the spider do its job, hunting for profiles similar to the ones in the search. At any time, you can view the full list of profiles visited during this run, with all the relevant pieces of information:

AttackInLog

 

The problem with generic spiders is that there is a very limited number of profiles that can be viewed each day, so the search algorithm must be specifically designed for this task, and focused to visiting the most relevant profiles and avoid wasting views on the same profile over time.

As the search progresses, we can check stats about the profiles analyzed. Unsurprisingly, the majority of profiles work at Ferrari, but the runner-up positions are from similar companies (Ducati and Maserati), and in the area between Modena and Milan:

AttackInStats

Quote this article on your site

To create link towards this article on your website,
copy and paste the text below in your page.




Preview :

A software to stand out
Saturday, 27 January 2018

Powered by QuoteThis © 2008
Last Updated on Saturday, 27 January 2018 15:12  

Latest Articles

A software to stand out 27 January 2018, 14.35 Web
A software to stand out
Standing out of the pack starts by being visible, and being noticed by the right group of professionals. No matter how good your profile is, it is lost in a sea of similar profiles, so you need to show up and start attracting
Web page scraping, the easy way 07 January 2018, 00.46 Web
Web page scraping, the easy way
There are many ways to extract data elements from web pages, almost all of them prettier and cooler than the method proposed here, but as we are in an hurry, let's get that data quickly, ok? Suppose we have to extract the
Scraping dynamic page content 06 January 2018, 23.57 Web
Scraping dynamic page content
One of the most common roadblocks when scraping the content of web sites is getting the full contents of the page, including JS-generated data elements (probably, the ones you are looking for). So, when using CEFSharp to scrape
Unit-testing file I/O 26 November 2017, 12.09 Testing
Unit-testing file I/O
Two good news: file I/O is unit-testable, and it is surprisingly easy to do. Let's see how it works! A software no-one asked for First, we need a piece of software that deals with files and that has to be unit-tested. The
Fixing Git pull errors in SourceTree 10 April 2017, 01.44 Software
Fixing Git pull errors in SourceTree
If you encounter the following error when pulling a repository in SourceTree: VirtualAlloc pointer is null, Win32 error 487 it is due to to the Cygwin system failing to allocate a 5 MB large chunk of memory for its heap at
View Stefano Tommesani's profile on LinkedIn