-
Notifications
You must be signed in to change notification settings - Fork 0
matteorr/MultiThreadWebCrawler
Folders and files
Name | Name | Last commit message | Last commit date | |
---|---|---|---|---|
Repository files navigation
/* * multithreadcrawler.cpp * * Created on: Feb 2013 * Author: Matteo Ruggero Ronchi * * Copyright 2013 Matteo Ruggero Ronchi * * Licensed under the Apache License, Version 2.0 (the "License"); * you may not use this file except in compliance with the License. * You may obtain a copy of the License at * * http://www.apache.org/licenses/LICENSE-2.0 * * Unless required by applicable law or agreed to in writing, software * distributed under the License is distributed on an "AS IS" BASIS, * WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. * See the License for the specific language governing permissions and * limitations under the License. */ /* * Like any free software license, the Apache License allows the user of the software * the freedom to use the software for any purpose, to distribute it, to modify it, * and to distribute modified versions of the software, under the terms of the license. * * The Apache License, does not require modified versions of the software * to be distributed using the same license (in contrast to copyleft licenses). * * In every licensed file, any original copyright, patent, trademark, * and attribution notices in redistributed code must be preserved * (excluding notices that do not pertain to any part of the derivative works), * and, in every licensed file changed, a notification must be added stating that changes have been made to that file. * * If a NOTICE text file is included as part of the distribution of the original work, * then derivative works must include a readable copy of these notices in at least one of three places: * - within a NOTICE text file distributed as part of the derivative works * - within the source form or documentation * - within a display generated by the derivative works (wherever such third-party notices normally appear). * * The contents of the NOTICE file do not modify the license, as they are for informational purposes only. * Modifications may have appropriate copyright notices, and may provide different license terms for the modifications. * */ /* * * Feel free to contact me for any kind of explanation request, suggestion, or bug individuation: * * - email address: [email protected] * - skype account: matteorr * */ /* * To execute program please use following instruction: * * ./crawler * * you need to put some html documents in the directory "html_docs" to begin crawling. Execution will result in the crawler accessing links he finds in the html docs, fecthing them and parsing them for * finding new links. * */
About
A C++ Multi Thread Implementation of a Web Crawler
Resources
Stars
Watchers
Forks
Releases
No releases published
Packages 0
No packages published