Data Mining

Data mining software is one of a number of analytical tools for analyzing data. It allows users to analyze data from many different dimensions or angles, categorize it, and summarize the relationships identified. Technically, data mining is the process of finding correlations or patterns among dozens of fields in large relational databases.

Data Mining

dexlock

Apache Nutch

Apache Nutch is an open source Java implementation of a search engine. It provides all of the tools you need to run your own search engine.Nutch is open source so we can access ranking algorithms. Nutch can add search to information of heterogeneous type or can use plugins to add additional functionalities

dexlock

Scrapy

An open source and collaborative framework for extracting the data you need from websites. In a fast, simple, yet extensible way. Its fast and powerful framework by writing he rules to extract the data scrapy extracts data from websites. Scrapy is extensible we can add new functionality easily without having to touch the core

dexlock

Beautiful Soup

Beautiful Soup is a Python library for pulling data out of HTML and XML files. It provides way for navigating, searching, and modifying the parse tree.