To see the other types of publications on this topic, follow the link: Web page.

Journal articles on the topic 'Web page'

Create a spot-on reference in APA, MLA, Chicago, Harvard, and other styles

Select a source type:

Consult the top 50 journal articles for your research on the topic 'Web page.'

Next to every source in the list of references, there is an 'Add to bibliography' button. Press on it, and we will generate automatically the bibliographic reference to the chosen work in the citation style you need: APA, MLA, Harvard, Chicago, Vancouver, etc.

You can also download the full text of the academic publication as pdf and read online its abstract whenever available in the metadata.

Browse journal articles on a wide variety of disciplines and organise your bibliography correctly.

1

Lei, Shi. "Modeling an web community discovery method with web page attraction." Journal of Intelligent & Fuzzy Systems 40, no. 6 (2021): 11159–69. http://dx.doi.org/10.3233/jifs-202366.

Full text
Abstract:
An improved Web community discovery algorithm is proposed in this paper based on the attraction between Web pages to effectively reduce the complexity of Web community discovery. The proposed algorithm treats each Web page in the Web pages collection as an individual with attraction based on the theory of universal gravitation, elaborates the discovery and evolution process of Web community from a Web page in the Web pages collection, defines the priority rules of Web community size and Web page similarity, and gives the calculation formula of the change in Web page similarity. Finally, an exp
APA, Harvard, Vancouver, ISO, and other styles
2

Apandi, Siti Hawa, Jamaludin Sallim, Rozlina Mohamed, and Norkhairi Ahmad. "Automatic Topic-Based Web Page Classification Using Deep Learning." JOIV : International Journal on Informatics Visualization 7, no. 3-2 (2023): 2108. http://dx.doi.org/10.30630/joiv.7.3-2.1616.

Full text
Abstract:
The internet is frequently surfed by people by using smartphones, laptops, or computers in order to search information online in the web. The increase of information in the web has made the web pages grow day by day. The automatic topic-based web page classification is used to manage the excessive amount of web pages by classifying them to different categories based on the web page content. Different machine learning algorithms have been employed as web page classifiers to categorise the web pages. However, there is lack of study that review classification of web pages using deep learning. In
APA, Harvard, Vancouver, ISO, and other styles
3

Y., Klushyn, and Zakharchin Y. "INCREASE THE SPEED OF WEB APPLICATIONS." Computer systems and network 2, no. 1 (2017): 33–43. http://dx.doi.org/10.23939/csn2020.01.033.

Full text
Abstract:
The article presents a method of creating a web application based on SPA technology (one-page web application), as a method of increasing the speed of web applications based on the use of modern frameworks, tools and tools for developing client and server part of a one-page web application. One-page web applications are web application technologies that consist of a single web page that interacts with the user, dynamically generating the current page rather than downloading entire new pages from the server. Based on this technique, we developed our own web application and based on it we determ
APA, Harvard, Vancouver, ISO, and other styles
4

Chen, Yuanchao, Yuliang Lu, Zulie Pan, et al. "APIMiner: Identifying Web Application APIs Based on Web Page States Similarity Analysis." Electronics 13, no. 6 (2024): 1112. http://dx.doi.org/10.3390/electronics13061112.

Full text
Abstract:
Modern web applications offer various APIs for data interaction. However, as the number of these APIs increases, so does the potential for security threats. Essentially, more APIs in an application can lead to more detectable vulnerabilities. Thus, it is crucial to identify APIs as comprehensively as possible in web applications. However, this task faces challenges due to the increasing complexity of web development techniques and the abundance of similar web pages. In this paper, we propose APIMiner, a framework for identifying APIs in web applications by dynamically traversing web pages base
APA, Harvard, Vancouver, ISO, and other styles
5

Apandi, Siti Hawa, Jamaludin Sallim, and Rozlina Mohamed. "A Convolutional Neural Network (CNN) Classification Model for Web Page: A Tool for Improving Web Page Category Detection Accuracy." JITSI : Jurnal Ilmiah Teknologi Sistem Informasi 4, no. 3 (2023): 110–21. http://dx.doi.org/10.30630/jitsi.4.3.181.

Full text
Abstract:
Game and Online Video Streaming are the most viewed web pages. Users who spend too much time on these types of web pages may suffer from internet addiction. Access to Game and Online Video Streaming web pages should be restricted to combat internet addiction. A tool is required to recognise the category of web pages based on the text content of the web pages. Due to the unavailability of a matrix representation that can handle long web page text content, this study employs a document representation known as word cloud image to visualise the words extracted from the text content web page after
APA, Harvard, Vancouver, ISO, and other styles
6

Apandi, Siti Hawa, Jamaludin Sallim, and Rozlina Mohamed. "A Convolutional Neural Network (CNN) Classification Model for Web Page: A Tool for Improving Web Page Category Detection Accuracy." JITSI : Jurnal Ilmiah Teknologi Sistem Informasi 4, no. 3 (2023): 110–21. https://doi.org/10.62527/jitsi.4.3.181.

Full text
Abstract:
Game and Online Video Streaming are the most viewed web pages. Users who spend too much time on these types of web pages may suffer from internet addiction. Access to Game and Online Video Streaming web pages should be restricted to combat internet addiction. A tool is required to recognise the category of web pages based on the text content of the web pages. Due to the unavailability of a matrix representation that can handle long web page text content, this study employs a document representation known as word cloud image to visualise the words extracted from the text content web page after
APA, Harvard, Vancouver, ISO, and other styles
7

Nandanwar, Amit Kumar, and Jaytrilok Choudhary. "Semantic Features with Contextual Knowledge-Based Web Page Categorization Using the GloVe Model and Stacked BiLSTM." Symmetry 13, no. 10 (2021): 1772. http://dx.doi.org/10.3390/sym13101772.

Full text
Abstract:
Internet technologies are emerging very fast nowadays, due to which web pages are generated exponentially. Web page categorization is required for searching and exploring relevant web pages based on users’ queries and is a tedious task. The majority of web page categorization techniques ignore semantic features and the contextual knowledge of the web page. This paper proposes a web page categorization method that categorizes web pages based on semantic features and contextual knowledge. Initially, the GloVe model is applied to capture the semantic features of the web pages. Thereafter, a Stack
APA, Harvard, Vancouver, ISO, and other styles
8

Limna, Das P., and Sanjeetha.R. "IMPROVING WEB INFORMATION QUALITY BY AGE DETECTION & SECURITY ANALYSIS OF WEB PAGES." International Journal of Advances in Engineering & Scientific Research 1, no. 4 (2014): 28–34. https://doi.org/10.5281/zenodo.10720350.

Full text
Abstract:
<strong><em>Abstract</em></strong><strong> </strong> <em>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; The Web is evolving very rapidly due to the ease of publishing information. At the same time, the Web is vulnerable to time passage as much new content is created continuously and old content becomes quickly obsolete. It is thus important to distinguish fresh and obsolete content in Web pages. Many web pages contain elements inserted at different time points. Some pages show timestamps or other temporal metadata informing about th
APA, Harvard, Vancouver, ISO, and other styles
9

Li, Xin Li. "Web Page Ranking Algorithm Based on the Meta-Information." Applied Mechanics and Materials 596 (July 2014): 292–96. http://dx.doi.org/10.4028/www.scientific.net/amm.596.292.

Full text
Abstract:
PageRank algorithms only consider hyperlink information, without other page information such as page hits frequency, page update time and web page category. Therefore, the algorithms rank a lot of advertising pages and old pages pretty high and can’t meet the users' needs. This paper further studies the page meta-information such as category, page hits frequency and page update time. The Web page with high hits frequency and with smaller age should get a high rank, while the above two factors are more or less dependent on page category. Experimental results show that the algorithm has good res
APA, Harvard, Vancouver, ISO, and other styles
10

Zhao, Wenjuan, and Zhongbao Liu. "Research on Web Page Classification Method based on Newton’s Law of Universal Gravitation and HITS Algorithm." Advances in Engineering Technology Research 13, no. 1 (2025): 856. https://doi.org/10.56028/aetr.13.1.856.2025.

Full text
Abstract:
Web page classification is one of the most important methods in web mining. In recent years, numerous classifiers are proposed and used for web page classification. Though these classifiers perform well in practice, they don't pay enough attention to the link connections between web pages, and therefore, their classification efficiencies can’t be greatly improved. We propose a web page classification method based on Newton’s Law of Universal Gravitation and HITS (Hypertext-Induced Topic Search) algorithm (WPCM), based on which, we constructs the web page classification system. In this system,
APA, Harvard, Vancouver, ISO, and other styles
11

Arase, Yuki, Takahiro Hara, Toshiaki Uemukai, and Shojiro Nishio. "Annotation and Auto-Scrolling for Web Page Overview in Mobile Web Browsing." International Journal of Handheld Computing Research 1, no. 4 (2010): 63–80. http://dx.doi.org/10.4018/jhcr.2010100104.

Full text
Abstract:
Due to advances in mobile phones, mobile Web browsing has become increasingly popular. In this regard, small screens and poor input capabilities of mobile phones prevent users from comfortably browsing Web pages that are designed for desktop PCs. One of the serious problems of mobile Web browsing is that users often get lost in a Web page and can only view a small portion of a Web page at a time, not able to grasp the entire page’s structure to decide which direction their information of interest is located. To solve this problem, an effective technique is to present an overview of the page. A
APA, Harvard, Vancouver, ISO, and other styles
12

Kwame, Boakye Agyapong, J.B.Hayfron-Acquah Dr., and M. Asante Dr. "AN OPTIMIZED PAGE RANK ALGORITHM WITH WEB MINING, WEB CONTENT MINING AND WEB STRUCTURE MINING." International Journal of Engineering Technologies and Management Research 4, no. 8 (2017): 22–27. https://doi.org/10.5281/zenodo.914660.

Full text
Abstract:
<strong><em>With the rapid increase in internet technology, users get easily confused in large hypertext structure. The primary goal of the web site owner is to provide the relevant information to the users to fulfill their needs. In order to achieve this goal, they use the concept of web mining. Web mining is used to categorize users and pages by analyzing the users" behaviour, the content of the pages, and the order of the URLs that tend to be accessed in order. Most of the search engines are ranking their search results in response to users' queries to make their search navigation easier. W
APA, Harvard, Vancouver, ISO, and other styles
13

Satish Babu, J., T. Ravi Kumar, and Dr Shahana Bano. "Optimizing webpage relevancy using page ranking and content based ranking." International Journal of Engineering & Technology 7, no. 2.7 (2018): 1025. http://dx.doi.org/10.14419/ijet.v7i2.7.12220.

Full text
Abstract:
Systems for web information mining can be isolated into a few classifications as indicated by a sort of mined data and objectives that specif-ic classifications set: Web structure mining, Web utilization mining, and Web Content Mining. This paper proposes another Web Content Mining system for page significance positioning taking into account the page content investigation. The strategy, we call it Page Content Rank (PCR) in the paper, consolidates various heuristics that appear to be critical for breaking down the substance of Web pages. The page significance is resolved on the base of the sig
APA, Harvard, Vancouver, ISO, and other styles
14

Muneeb Ahmed Farooqi, Muhammad Arslan Ashraf, and Muhammad Umer Shaukat. "Google Page Rank Site Structure Strategies for Marketing Web Pages." Journal of Computing & Biomedical Informatics 2, no. 02 (2021): 140–57. http://dx.doi.org/10.56979/202/2021/30.

Full text
Abstract:
There are several Search Engines to categorize the web content and show us on base of our search query.These search engines are continuously visiting the pages/sites and gather the information using different techniques called crawling/spidering. On basis of daily content collection all search engines are managing their own indexes for searches.For every business there is a need to make its pages as most top rated/ranked pages by making structurally and content wise batter so that any crawler can easily crawl it and can rank it among the top 10 results.In this thesis only, structural behavior
APA, Harvard, Vancouver, ISO, and other styles
15

Kapusta, Jozef, Michal Munk, and Martin Drlik. "Website Structure Improvement Based on the Combination of Selected Web Structure and Web Usage Mining Methods." International Journal of Information Technology & Decision Making 17, no. 06 (2018): 1743–76. http://dx.doi.org/10.1142/s0219622018500402.

Full text
Abstract:
The different web mining methods and techniques can help to solve some typical issues of the contemporary websites, contribute to more effective personalization, improve a website structure and reorganize its web pages. However, only several papers tried to combine web structure and web usage mining (WUM) methods with this aim. The paper researches if and how the combination of selected web structure and WUM methods can identify misplaced web pages and how they can contribute to improving the website structure. The paper analyzes the relationship between the estimated importance of the web pag
APA, Harvard, Vancouver, ISO, and other styles
16

Grigalis, Tomas, and Antanas Čenys. "Unsupervised Structured Data Extraction from Template-generated Web Pages." JUCS - Journal of Universal Computer Science 20, no. (2) (2014): 169–92. https://doi.org/10.3217/jucs-020-02-0169.

Full text
Abstract:
This paper studies structured data extraction from template-generated Web pages. Such pages contain most of structured data on the Web. Extracted structured data can be later integrated and reused in very big range of applications, such as price comparison portals, business intelligence tools, various mashups and etc. It encourages industry and academics to seek automatic solutions. To tackle the problem of automatic structured Web data extraction we present a new approach - structured data extraction based on clustering visually similar Web page elements. Our method called ClustVX combines vi
APA, Harvard, Vancouver, ISO, and other styles
17

Putra Eka Prismana, Gusti Lanang. "Automatic Web News Content Extraction." Journal Research of Social, Science, Economics, and Management 1, no. 7 (2022): 785–94. http://dx.doi.org/10.36418/jrssem.v1i7.107.

Full text
Abstract:
The extraction of the main content of web pages is widely used in search engines, but a lot of irrelevant information, such as advertisements, navigation, and junk information, is included in web pages. Such irrelevant information reduces the efficiency of web content processing in content-based applications. This study aimed to extract web pages using DOM Tree in the rationality of segmentation results and efficiency based on the information entropy of nodes from the DOM Tree. The first step of this research was to classify web page tags and only processed tags that affected the structure of
APA, Harvard, Vancouver, ISO, and other styles
18

Putra Eka Prismana, Gusti Lanang. "Automatic Web News Content Extraction." Journal Research of Social Science, Economics, and Management 1, no. 7 (2022): 785–94. http://dx.doi.org/10.59141/jrssem.v1i7.107.

Full text
Abstract:
The extraction of the main content of web pages is widely used in search engines, but a lot of irrelevant information, such as advertisements, navigation, and junk information, is included in web pages. Such irrelevant information reduces the efficiency of web content processing in content-based applications. This study aimed to extract web pages using DOM Tree in the rationality of segmentation results and efficiency based on the information entropy of nodes from the DOM Tree. The first step of this research was to classify web page tags and only processed tags that affected the structure of
APA, Harvard, Vancouver, ISO, and other styles
19

Meara, J. "Web page." Age and Ageing 32, no. 3 (2003): 355. http://dx.doi.org/10.1093/ageing/32.3.355.

Full text
APA, Harvard, Vancouver, ISO, and other styles
20

Lingaraju, Dr G. M., and Dr S. Jagannatha. "Review of Web Page Classification and Web Content Mining." Journal of Advanced Research in Dynamical and Control Systems 11, no. 10 (2019): 142–47. http://dx.doi.org/10.5373/jardcs/v11i10/20193017.

Full text
APA, Harvard, Vancouver, ISO, and other styles
21

Mani Sekhar, S. R., G. M. Siddesh, Sunilkumar S. Manvi, and K. G. Srinivasa. "Optimized Focused Web Crawler with Natural Language Processing Based Relevance Measure in Bioinformatics Web Sources." Cybernetics and Information Technologies 19, no. 2 (2019): 146–58. http://dx.doi.org/10.2478/cait-2019-0021.

Full text
Abstract:
Abstract In the fast growing of digital technologies, crawlers and search engines face unpredictable challenges. Focused web-crawlers are essential for mining the boundless data available on the internet. Web-Crawlers face indeterminate latency problem due to differences in their response time. The proposed work attempts to optimize the designing and implementation of Focused Web-Crawlers using Master-Slave architecture for Bioinformatics web sources. Focused Crawlers ideally should crawl only relevant pages, but the relevance of the page can only be estimated after crawling the genomics pages
APA, Harvard, Vancouver, ISO, and other styles
22

hra, Chait, Dr G. M. Lingaraju, and Dr S. Jagannatha. "Automatic Web Page Classification System with Improved Accuracy." Webology 18, no. 2 (2021): 225–42. http://dx.doi.org/10.14704/web/v18i2/web18318.

Full text
Abstract:
Nowadays, the Internet contain s a wide variety of online documents, making finding useful information about a given subject impossible, as well as retrieving irrelevant pages. Web document and page recognition software is useful in a variety of fields, including news, medicine, and fitness, research, and information technology. To enhance search capability, a large number of web page classification methods have been proposed, especially for news web pages. Furthermore existing classification approaches seek to distinguish news web pages while still reducing the high dimensionality of features
APA, Harvard, Vancouver, ISO, and other styles
23

Om Prakash, P. G., K. Suresh Kumar, Balajee Maram, and C. Priya. "Deep Fuzzy Clustering and Deep Residual Network for Prediction of Web Pages from Weblog Data with Fractional Order Based Ranking." International Journal of Uncertainty, Fuzziness and Knowledge-Based Systems 31, no. 03 (2023): 413–36. http://dx.doi.org/10.1142/s0218488523500216.

Full text
Abstract:
Web page recommendation system has attracted more attention in recent decades. The web page recommendation has various characteristics than the classical recommenders. It is the process of predicting the request of the next web page that users are significantly interested while searching the web. It helps the users to find relevant pages in the field of web mining. In particular, web user may spend more time to identify expected information. To understand behavior of users and to visit the page based on their interest at a specific time, an effective web page recommendation method is developed
APA, Harvard, Vancouver, ISO, and other styles
24

GAO, XIAOYING, MENGJIE ZHANG, and PETER ANDREAE. "AUTOMATIC PATTERN CONSTRUCTION FOR WEB INFORMATION EXTRACTION." International Journal of Uncertainty, Fuzziness and Knowledge-Based Systems 12, no. 04 (2004): 447–70. http://dx.doi.org/10.1142/s0218488504002928.

Full text
Abstract:
This paper describes a domain independent approach for automatically constructing information extraction patterns for semi-structured web pages. Given a randomly chosen page from a web site of similarly structured pages, the system identifies a region of the page that has a regular "tabular" structure, and then infers an extraction pattern that will match the "rows" of the region and identify the data elements. The approach was tested on three corpora containing a series of tabular web sites from different domains and achieved a success rate of at least 80%. A significant strength of the syste
APA, Harvard, Vancouver, ISO, and other styles
25

Agyapong, Kwame, J. B. Hayfron Acquah, and M. Asante. "AN OPTIMIZED PAGE RANK ALGORITHM WITH WEB MINING, WEB CONTENT MINING AND WEB STRUCTURE MINING." International Journal of Engineering Technologies and Management Research 4, no. 8 (2020): 22–27. http://dx.doi.org/10.29121/ijetmr.v4.i8.2017.91.

Full text
Abstract:
With the rapid increase in internet technology, users get easily confused in large hypertext structure. The primary goal of the web site owner is to provide the relevant information to the users to fulfill their needs. In order to achieve this goal, they use the concept of web mining. Web mining is used to categorize users and pages by analyzing the users‟ behaviour, the content of the pages, and the order of the URLs that tend to be accessed in order. Most of the search engines are ranking their search results in response to users' queries to make their search navigation easier. With a web br
APA, Harvard, Vancouver, ISO, and other styles
26

Isha, Mahajan. "Extended Weighted Page Rank Based on VOL by Finding User Activities Time and Page Reading Time, Storing them Directly on Search Engine Database Server." International Journal of Engineering Works (ISSN:2409-2770) 4, no. 2 (2017): 41–48. https://doi.org/10.5281/zenodo.376487.

Full text
Abstract:
Searching on the web can be considered as a process of user enters the query and search system returns a set of most relevant pages in response to user’s query. But results returned are not mostly relevant to user’s query and ranking of the pages are not efficient according to user requirement. In order to improve the precision of ranking of the web pages, after analyzing the different algorithms like Page Rank, Weighted Page Rank, Page Rank based on VOL, Weighted Page Rank algorithm based on VOL. In this paper, we are proposing enhancement by including “User Activities Time” and “Page Reading
APA, Harvard, Vancouver, ISO, and other styles
27

Li, Xingchen, Weizhe Zhang, Desheng Wang, Bin Zhang, and Hui He. "Algorithm of web page similarity comparison based on visual block." Computer Science and Information Systems 16, no. 3 (2019): 815–30. http://dx.doi.org/10.2298/csis180915028l.

Full text
Abstract:
Phishing often deceives users due to the relative similarity to the true pages on a layout and leads to considerable losses for the society. Consequently, detecting phishing sites has been an urgent activity. By researching phishing web pages using web page screenshots, we discover that this kind of web pages use numerous web page screenshots to achieve the close similarity to the true page and avoid the text and structure similarity detection. This study introduces a new similarity matching algorithm based on visual blocks. First, the RenderLayer tree of the web page is obtained to extract th
APA, Harvard, Vancouver, ISO, and other styles
28

Kohei Sakamoto1, Chieko Kato. "Effects on Memorized Information Quantity in Web Pages Using Bicolor Design-from the Perspective of Color Blind People and Non-Color Blind People." Indian Journal of Public Health Research & Development 11, no. 1 (2020): 1839–43. http://dx.doi.org/10.37506/ijphrd.v11i1.1389.

Full text
Abstract:
In Japan according to the Japanese Ophthamlogical Society, people with color blindness make up about 5% of men and about 0.2% of women. Colorblind people often experience inconvenient situations in their lives because they cannot distinguish colors. Web pages are no exception. Companies and governments use web pages to transmit information to people. In this study, we created a web page with a color scheme obtained from previous research. This study confirmed that colorblind people, after viewing such a web page, remembered its contents better. From these results, this study analyzed the role
APA, Harvard, Vancouver, ISO, and other styles
29

El Louadi, Mohamed, and Imen Ben Ali. "Perceived and Actual Web Page Loading Delay." Journal of Information Technology Research 3, no. 2 (2010): 50–66. http://dx.doi.org/10.4018/jitr.2010040104.

Full text
Abstract:
The major complaint users have about using the Web is that they must wait for information to load onto their screen. This is more acute in countries where bandwidth is limited and fees are high. Given bandwidth limitations, Web pages are often hard to accelerate. Predictive feedback information is assumed to distort Internet users’ perception of time, making them more tolerant of low speed. This paper explores the relationship between actual Web page loading delay and perceived Web page loading delay and two aspects of user satisfaction: the Internet user’s satisfaction with the Web page loadi
APA, Harvard, Vancouver, ISO, and other styles
30

Ahmad Sabri, Ily Amalina, and Mustafa Man. "Improving Performance of DOM in Semi-structured Data Extraction using WEIDJ Model." Indonesian Journal of Electrical Engineering and Computer Science 9, no. 3 (2018): 752. http://dx.doi.org/10.11591/ijeecs.v9.i3.pp752-763.

Full text
Abstract:
&lt;p&gt;Web data extraction is the process of extracting user required information from web page. The information consists of semi-structured data not in structured format. The extraction data involves the web documents in html format. Nowadays, most people uses web data extractors because the extraction involve large information which makes the process of manual information extraction takes time and complicated. We present in this paper WEIDJ approach to extract images from the web, whose goal is to harvest images as object from template-based html pages. The WEIDJ (Web Extraction Image usin
APA, Harvard, Vancouver, ISO, and other styles
31

Ily, Amalina Ahmad Sabri, and Man Mustafa. "Improving Performance of DOM in Semi-structured Data Extraction Using WEIDJ Model." Indonesian Journal of Electrical Engineering and Computer Science 9, no. 3 (2018): 752–63. https://doi.org/10.11591/ijeecs.v9.i3.pp752-763.

Full text
Abstract:
Web data extraction is the process of extracting user required information from web page. The information consists of semi-structured data not in structured format. The extraction data involves the web documents in html format. Nowadays, most people uses web data extractors because the extraction involve large information which makes the process of manual information extraction takes time and complicated. We present in this paper WEIDJ approach to extract images from the web, whose goal is to harvest images as object from template-based html pages. The WEIDJ (Web Extraction Image using DOM (Do
APA, Harvard, Vancouver, ISO, and other styles
32

Rahul, kumar1 and Anurag Jain2. "Efficient Crawling Through Dynamic Priority of Web Page in Sitemap." Efficient Crawling Through Dynamic Priority of Web Page in Sitemap 02, Jun (2014): 01–11. https://doi.org/10.5281/zenodo.1324011.

Full text
Abstract:
A web crawler or automatic indexer is used to download updated information from World Wide Web (www) for search engine. It is estimated that current size of Google index is approx 8*109 pages and crawling costs could be around 4 million dollars for a full crawl if only considered network costs. Thus we need to download only most important pages. In order toward, we propose &ldquo;Efficient crawling through dynamic page priority of web pages in Sitemap&rdquo; which is query based approach to inform most important pages to web crawler through sitemap protocol in dynamic page priority. Through th
APA, Harvard, Vancouver, ISO, and other styles
33

Gupta, Renu, Ankita Shah, Amit Thakkar, and Kamlesh Makvana. "A Survey on Various Web Page Ranking Algorithms." COMPUSOFT: An International Journal of Advanced Computer Technology 05, no. 01 (2016): 2046–52. https://doi.org/10.5281/zenodo.14789798.

Full text
Abstract:
World is full of information and searching is most common task on web. As the amount of information available on web is increasing, it is difficult to acquire relevant information on web. User enters a query for retrieving required information from www and millions of web pages are fetched. These web pages or search results contain both relevant pages and irrelevant search results in response to query submitted by user. For this issue efficient Page Ranking algorithm is needed. Google uses very basic algorithm called Page Rank algorithm which uses web structure mining and has some limitations.
APA, Harvard, Vancouver, ISO, and other styles
34

Blanco, Lorenzo, Valter Crescenzi, and Paolo Merialdo. "Structure and Semantics of Data-IntensiveWeb Pages: An Experimental Study on their Relationships." JUCS - Journal of Universal Computer Science 14, no. (11) (2008): 1877–92. https://doi.org/10.3217/jucs-014-11-1877.

Full text
Abstract:
In data-intensive web sites pages are generated by scripts that embed data from a backend database into HTML templates. There is usually a relationship between the semantics of the data in a page and its corresponding template. For example, in a web site about sports events, it is likely that pages with data about athletes are associated with a template that differs from the template used to generate pages about coaches or referees. This article presents a method to classify web pages according to the associated template. Given a web page, the goal of our method is to accurately find the pages
APA, Harvard, Vancouver, ISO, and other styles
35

Papacharissi, Zizi. "The Presentation of Self in Virtual Life: Characteristics of Personal Home Pages." Journalism & Mass Communication Quarterly 79, no. 3 (2002): 643–60. http://dx.doi.org/10.1177/107769900207900307.

Full text
Abstract:
This study focused on how individuals used personal home pages to present themselves online. Content analysis was used to examine, record, and analyze the characteristics of personal home pages. Data interpretation revealed popular tools for self-presentation, a desire for virtual homesteaders to affiliate with online homestead communities, and significant relationships among home page characteristics. Web page design was influenced, to a certain extent, by the tools Web page space providers supplied. Further studies should consider personality characteristics, design templates, and Web author
APA, Harvard, Vancouver, ISO, and other styles
36

Khalil, Nida, Saniah Rehan, Abeer Javed Syed, Khalid Mahboob, Fayyaz Ali та Fatima Waseem. "Optimizing the Efficiency of Web Mining through Comparative Web Ranking Algorithms". VFAST Transactions on Software Engineering 11, № 4 (2023): 105–23. http://dx.doi.org/10.21015/vtse.v11i4.1667.

Full text
Abstract:
Millions of web pages carrying massive amounts of data make up the World Wide Web. Real-time data has been generated on a wide scale on the websites. However, not every piece of data is relevant to the user. While scouring the web for information, a user may come upon a web page that contains irrelevant or incomplete information. As a response, search engines can alleviate this issue by displaying the most relevant pages. Two web page ranking algorithms are proposed in this study along with the Dijkstra algorithm; the PageRank algorithm and the Weighted PageRank algorithm. The algorithms are u
APA, Harvard, Vancouver, ISO, and other styles
37

Jaganathan, B., and Kalyani Desikan. "Enhanced Web Page Ranking Method Using Laplacian Centrality." International Journal of Engineering & Technology 7, no. 4.10 (2018): 566. http://dx.doi.org/10.14419/ijet.v7i4.10.21282.

Full text
Abstract:
In today's era of computer technology where users want not only the most relevant data but they also want the data as quickly as possible. Hence, ranking web pages becomes a crucial task. The purpose of this research is to find a centrality measure that can be used in place of original page rank. In this article concept of Laplacian centrality measure for directed web graph has been introduced to identify the web page ranks. Comparison between the original page rank and Laplacian centrality based Page rank has been made. Kendall's correlation co-efficient has been used as a measure to find the
APA, Harvard, Vancouver, ISO, and other styles
38

Kaur, Satinder, and Sunil Gupta. "PREDICTION OF DESIGN ASPECTS OF WEB PAGE BY HTML PARSER." International Journal of Engineering Technologies and Management Research 5, no. 2 (2020): 143–58. http://dx.doi.org/10.29121/ijetmr.v5.i2.2018.157.

Full text
Abstract:
Inform plays a very important role in life and nowadays, the world largely depends on the World Wide Web to obtain any information. Web comprises of a lot of websites of every discipline, whereas websites consists of web pages which are interlinked with each other with the help of hyperlinks. The success of a website largely depends on the design aspects of the web pages. Researchers have done a lot of work to appraise the web pages quantitatively. Keeping in mind the importance of the design aspects of a web page, this paper aims at the design of an automated evaluation tool which evaluate th
APA, Harvard, Vancouver, ISO, and other styles
39

Massaro, Alessandro, Daniele Giannone, Vitangelo Birardi, and Angelo Maurizio Galiano. "An Innovative Approach for the Evaluation of the Web Page Impact Combining User Experience and Neural Network Score." Future Internet 13, no. 6 (2021): 145. http://dx.doi.org/10.3390/fi13060145.

Full text
Abstract:
The proposed paper introduces an innovative methodology useful to assign intelligent scores to web pages. The approach is based on the simultaneous use of User eXperience (UX), Artificial Neural Network (ANN), and Long Short-Term Memory (LSTM) algorithms, providing the web page scoring and taking into account outlier conditions to construct the training dataset. Specifically, the UX tool analyses different parameters addressing the score, such as navigation time, number of clicks, and mouse movements for page, finding possible outliers, the ANN are able to predict outliers, and the LSTM proces
APA, Harvard, Vancouver, ISO, and other styles
40

S, JEGAJITH. "KNOWLEDGE BASED APPROACH TO DETECT POTENTIALLY RISKY WEBSITES." International Scientific Journal of Engineering and Management 04, no. 05 (2025): 1–7. https://doi.org/10.55041/isjem03381.

Full text
Abstract:
ABSTRACT: The challenge of grouping web pages inside a website so that each cluster contains a collection of web pages that can be categorised using a distinct class is known as unsupervised web page categorization. A number of requirements that would make the existing proposals for web page classification suitable for enterprise web information integration are not met, including the need to be unsupervised, which eliminates the need for a training set of pre-classified pages, to be based on lightweight crawling to prevent interfering with the website's normal operation, and to use features fr
APA, Harvard, Vancouver, ISO, and other styles
41

Zhang, Zuping, Jing Zhao, and Xiping Yan. "A Web Page Clustering Method Based on Formal Concept Analysis." Information 9, no. 9 (2018): 228. http://dx.doi.org/10.3390/info9090228.

Full text
Abstract:
Web page clustering is an important technology for sorting network resources. By extraction and clustering based on the similarity of the Web page, a large amount of information on a Web page can be organized effectively. In this paper, after describing the extraction of Web feature words, calculation methods for the weighting of feature words are studied deeply. Taking Web pages as objects and Web feature words as attributes, a formal context is constructed for using formal concept analysis. An algorithm for constructing a concept lattice based on cross data links was proposed and was success
APA, Harvard, Vancouver, ISO, and other styles
42

Guha, Sutirtha Kumar, Anirban Kundu, and Rana Duttagupta. "Introducing Link Based Weightage for Web Page Ranking." International Journal of Artificial Life Research 5, no. 1 (2015): 41–55. http://dx.doi.org/10.4018/ijalr.2015010103.

Full text
Abstract:
In this paper the authors are going to propose a new rank measurement technique by introducing weightage factor based on number of Web links available on a particular Web page. Available Web links are considered as an important importance indicator. Distinct weightage factor is assigned to the Web pages as these are calculated based on the Web links. Different Web pages are evaluated more accurately due to the independent and uniqueness of weightage factor. Better Web page ranking is achieved as it depends on specific weightage factor. Impact of unwanted intruder is minimized by the introducti
APA, Harvard, Vancouver, ISO, and other styles
43

Abdulrahman, Ayad. "Web Pages Ranking Algorithms: A Survey." Qubahan Academic Journal 1, no. 3 (2021): 29–34. http://dx.doi.org/10.48161/qaj.v1n3a79.

Full text
Abstract:
Due to the daily expansion of the web, the amount of information has increased significantly. Thus, the need for retrieving relevant information has also increased. In order to explore the internet, users depend on various search engines. Search engines face a significant challenge in returning the most relevant results for a user's query. The search engine's performance is determined by the algorithm used to rank web pages, which prioritizes the pages with the most relevancy to appear at the top of the result page. In this paper, various web page ranking algorithms such as Page Rank, Time Ran
APA, Harvard, Vancouver, ISO, and other styles
44

Lu, Houqing, Donghui Zhan, Lei Zhou, and Dengchao He. "An Improved Focused Crawler: Using Web Page Classification and Link Priority Evaluation." Mathematical Problems in Engineering 2016 (2016): 1–10. http://dx.doi.org/10.1155/2016/6406901.

Full text
Abstract:
A focused crawler is topic-specific and aims selectively to collect web pages that are relevant to a given topic from the Internet. However, the performance of the current focused crawling can easily suffer the impact of the environments of web pages and multiple topic web pages. In the crawling process, a highly relevant region may be ignored owing to the low overall relevance of that page, and anchor text or link-context may misguide crawlers. In order to solve these problems, this paper proposes a new focused crawler. First, we build a web page classifier based on improved term weighting ap
APA, Harvard, Vancouver, ISO, and other styles
45

Xing-Hua, Lu, Ye Wen-Quan, and Liu Ming-Yuan. "Personalized Recommendation Algorithm for Web Pages Based on Associ ation Rule Mining." MATEC Web of Conferences 173 (2018): 03020. http://dx.doi.org/10.1051/matecconf/201817303020.

Full text
Abstract:
In order to improve the user ' s ability to access websites and web pages, according to the interest preference of the user, the personalized recommendation design is carried out, and the personalized recommendation model for web page visit is established to meet the personalized interest demand of the user to browse the web page. A webpage personalized recommendation algorithm based on association rule mining is proposed. Based on the semantic features of web pages, user browsing behavior is calculated by similarity computation, and web crawler algorithm is constructed to extract the semantic
APA, Harvard, Vancouver, ISO, and other styles
46

HAYASHI, Takahiro, Syo KATAHIRA, Atsushi INUZUKA, and Rikio ONAI. "Retrieval of Personal Web Pages Based on Web Page Clustering." Journal of Japan Society for Fuzzy Theory and Intelligent Informatics 18, no. 2 (2006): 161–72. http://dx.doi.org/10.3156/jsoft.18.161.

Full text
APA, Harvard, Vancouver, ISO, and other styles
47

Prieto, Víctor, Manuel Álvarez, Víctor Carneiro, and Fidel Cacheda. "Distributed and collaborative Web Change Detection system." Computer Science and Information Systems 12, no. 1 (2015): 91–114. http://dx.doi.org/10.2298/csis131120081p.

Full text
Abstract:
Search engines use crawlers to traverse the Web in order to download web pages and build their indexes. Maintaining these indexes up-to-date is an essential task to ensure the quality of search results. However, changes in web pages are unpredictable. Identifying the moment when a web page changes as soon as possible and with minimal computational cost is a major challenge. In this article we present the Web Change Detection system that, in a best case scenario, is capable to detect, almost in real time, when a web page changes. In a worst case scenario, it will require, on average, 12 minutes
APA, Harvard, Vancouver, ISO, and other styles
48

Prieto, Álvarez Víctor Manuel, Díaz Manuel Álvarez, Díaz Víctor Manuel Carneiro, and Seijo Fidel Cacheda. "Distributed and Collaborative Web Change Detection System." Computer Science and Information Systems 12, no. 1 (2015): 91–114. https://doi.org/10.2298/CSIS131120081P.

Full text
Abstract:
[Abstract]: Search engines use crawlers to traverse the Web in order to download web pages and build their indexes. Maintaining these indexes up-to-date is an essential task to ensure the quality of search results. However, changes in web pages are unpredictable. Identifying the moment when a web page changes as soon as possible and with minimal computational cost is a major challenge. In this article we present the Web Change Detection system that, in a best case scenario, is capable to detect, almost in real time, when a web page changes. In a worst case scenario, it will require, on average
APA, Harvard, Vancouver, ISO, and other styles
49

Satinder, Kaur, and Kumar Gupta Sunil. "PREDICTION OF DESIGN ASPECTS OF WEB PAGE BY HTML PARSER." International Journal of Engineering Technologies and Management Research 5, no. 2 (2018): 143–58. https://doi.org/10.5281/zenodo.1186120.

Full text
Abstract:
<strong><em>Inform plays a very important role in life and nowadays, the world largely depends on the World Wide Web to obtain any information. Web comprises of a lot of websites of every discipline, whereas websites consists of web pages which are interlinked with each other with the help of hyperlinks. The success of a website largely depends on the design aspects of the web pages. Researchers have done a lot of work to appraise the web pages quantitatively. Keeping in mind the importance of the design aspects of a web page, this paper aims at the design of an automated evaluation tool which
APA, Harvard, Vancouver, ISO, and other styles
50

Scott, S. D., and Y. H. Koh. "Design Metrics and the Adaptation of Web-Page Content Chunks for PDAs." Journal of IT in Asia 1, no. 1 (2017): 35–51. http://dx.doi.org/10.33736/jita.404.2005.

Full text
Abstract:
The majority of web-pages are unsuitable for viewing on PDAs, WAP phones and similar devices without first being adapted. However, little empirical work has been done on what actually constitutes a good PDA or WAP web-page. This paper ranks a number of PDA web-pages from different categories empirically and correlates the result against the design metrics present. The findings are then compared against a similar set of experiments for PC web-pages. The results of this comparison suggest that, as well as omitting, summarizing and converting individual multimedia objects in the web-page to a les
APA, Harvard, Vancouver, ISO, and other styles
We offer discounts on all premium plans for authors whose works are included in thematic literature selections. Contact us to get a unique promo code!