Similar presentations:
Blog Spider
1.
Blog SpiderSerhii Lukashov
2.
IntroducingBlogSpider is a website project that allows user to crawl pages find on them RSS channels and store.
The main goal of the project was to learn new technologies and dive into the AKKA.net.
3.
Project structureProject consists of four main parts:
Lighthouse
Crawler
Tracker
Web application
4.
Base crawling alghoritmHere you can see base idea of
crawling alghoritm
5.
Base concept of crawler clusterHere you can see basic roles wich must be in
crawler cluster.
Web - web application wich run
some job to crawl.
Tracker - this service which tell what we nee to crawl
6.
What is lighthouse?Lighthouse is a dedicated seed nodes tool for our cluster. It only has to be operate one occur cluster itself
is upgraded and it’s not actually deployed as part of your application, so it should never have to be
redeployed when you make no changes but it will need to be upgraded as occur that cluster gets
upgraded.
7.
Let`s look how it work8.
Project conclusionsAt this project i`ve got a lot of knoledges about implementing cluster by using Akka.net also learn such
frameworks as Topshelf, Quartz.net, SinglaR