A Model of Vertical Crawler Based on Hidden Markov Chain
Abstract: The large size and
the dynamic nature of the Web make it necessary to continually maintain Web
based information retrieval systems. In order to get more objects by visiting
few irrelevant web pages, the web crawler usually takes the heuristic searching
strategy that ranks urls by their importance and preferentially visits the more
important web pages. While some systems rely on crawlers that exhaustively
crawl the Web, others incorporate “focus” within their crawlers to harvest
application or topic-specific collections. In this paper, using the Hidden
Markov Model(HMM) learning ability to solve the problem of the theme of the
crawler drift, has obtained the certain effect.
Author: Ye Hu, Jun Tu, Wangyu
Tong
Journal Code: jptkomputergg140104

Artikel Terkait :
Jp Teknik Komputer dd 2014
- Development of Wireless Electric Field Mill for Atmospheric Electric Field Observation
- A Novel Intrusion Detection Approach using MultiKernel Functions
- Process Improvement of LSA for Semantic Relatedness Computing
- An Algorithm Based on Wavelet Neural Network for Garment Size Selection
- Face Recognition Using Invariance with a Single Training Sample
- Dynamic DEMATEL Group Decision Approach Based on Intuitionistic Fuzzy Number
- A Reliable Web Services Selection Method for Concurrent Requests
- OWLS-CSM: A Service Profile Based Similarity Framework for Web Service Discovery
- Low Energy Adaptive Clustering Hierarchy Routing Protocol for Wireless Sensor Network
- Trusted Node-Based Algorithm to Secure Home Agent NATed IPv4 Network from IPv6 Routing Header Attacks
- Wireless Sensor Based Hybrid Architecture for Vehicular Ad hoc Networks
- Feature Selection Method Based on Improved Document Frequency
- Complex Optimization Problems Using Highly Efficient Particle Swarm Optimizer
- Image Fuzzy Enhancement Based on Self-Adaptive Bee Colony Algorithm
- Application of Chaotic Particle Swarm Optimization in Wavelet Neural Network
- Image Deblurring Via an Adaptive Dictionary Learning Strategy
- Multi-objective Optimization Based on Improved Differential Evolution Algorithm
- Unambiguous Acquisition for Galileo E1 OS Signal Based on Delay and Multiply
- Non-Planar MOSFET Modeling with Analytical Approach
- Pre-Timed and Coordinated Traffic Controller Systems Based on AVR Microcontroller
- Cooperative Avoidance Control-based Interval Fuzzy Kohonen Networks Algorithm in Simple Swarm Robots
- Windows Communication Foundation for Banyumas Tourism and Culinary Information System
- The use of ON-OFF and ANN Controllers for Automated Irrigation System Model Based on Penman-Monteith Evapotranspiration
- Cost Forecasting Model of Transmission Project based on the PSO-BP Method
- A New Algorithm for Detecting Local Community Based on Random Walk