Log in

Relevant bibliographies by topics / HTML documents / Journal articles

To see the other types of publications on this topic, follow the link: HTML documents.

Journal articles on the topic 'HTML documents'

Author: Grafiati

Published: 4 June 2021

Last updated: 25 July 2025

Create a spot-on reference in APA, MLA, Chicago, Harvard, and other styles

Select a source type:

Consult the top 50 journal articles for your research on the topic 'HTML documents.'

Next to every source in the list of references, there is an 'Add to bibliography' button. Press on it, and we will generate automatically the bibliographic reference to the chosen work in the citation style you need: APA, MLA, Harvard, Chicago, Vancouver, etc.

You can also download the full text of the academic publication as pdf and read online its abstract whenever available in the metadata.

Browse journal articles on a wide variety of disciplines and organise your bibliography correctly.

1

Bonhomme, Stéphane, and Cécile Roisin. "Interactively restructuring HTML documents." Computer Networks and ISDN Systems 28, no. 7-11 (1996): 1075–84. http://dx.doi.org/10.1016/0169-7552(96)00042-6.

Full text

APA, Harvard, Vancouver, ISO, and other styles

2

Hwang, Hyun-Cheon, and Woo-Je Kim. "Design of Enhanced Document HTML and the Reliable Electronic Document Distribution Service." Electronics 12, no. 10 (2023): 2176. http://dx.doi.org/10.3390/electronics12102176.

Full text

Abstract:

Electronic documents are becoming increasingly popular in various industries and sectors as they provide greater convenience and cost-efficiency than physical documents. PDF is a widely used format for creating and sharing electronic documents, while HTML is commonly used in mobile environments as the foundation for creating web pages displayed on mobile devices, such as smartphones and tablets. HTML is becoming a more critical document format as mobile environments have been raised as the primary communication channel nowadays. However, HTML does not have the standard content integrity featur

APA, Harvard, Vancouver, ISO, and other styles

3

Kolbitsch, Josef. "Fine-Grained Transclusions of Multimedia Documents in HTML." JUCS - Journal of Universal Computer Science 11, no. (6) (2005): 926–43. https://doi.org/10.3217/jucs-011-06-0926.

Full text

Abstract:

Transclusions are a technique for virtually including existing content into new documents by reference to the original documents rather than by copying. In principle, transclusions are used in HTML for the inclusion of entire text documents, images, movies and similar media. The HTML specification only takes transclusions of entire documents into account, though. Hence it is not possible, for instance, to include a part of an existing image into an HTML document. In this paper, fine-grained transclusion of multimedia documents on the Web are proposed, which presents a logical realisation of th

APA, Harvard, Vancouver, ISO, and other styles

4

Sato, S. y. "Dynamic rewriting of HTML documents." Computer Networks and ISDN Systems 27, no. 2 (1994): 307–8. http://dx.doi.org/10.1016/s0169-7552(94)90147-3.

Full text

APA, Harvard, Vancouver, ISO, and other styles

5

von Tetzchner, J. Stephenson. "Converting formatted documents to HTML." Computer Networks and ISDN Systems 27, no. 2 (1994): 309–10. http://dx.doi.org/10.1016/s0169-7552(94)90154-6.

Full text

APA, Harvard, Vancouver, ISO, and other styles

6

Wang, Lucy Lu, Jonathan Bragg, and Daniel S. Weld. "Paper to HTML." ACM SIGACCESS Accessibility and Computing, no. 134 (October 2022): 1. http://dx.doi.org/10.1145/3582298.3582299.

Full text

Abstract:

Most scientific papers are distributed in PDF format, which is by default inaccessible to blind and low vision audiences and people who use assistive reading technology. These access barriers hinder and may even deter members of these groups from pursuing careers or opportunities that necessitate the reading of technical documents. In cases where no accessible versions of papers are made available by publishers or authors, the gold standard for PDF document accessibility is PDF remediation. Remediation is the process by which a PDF is made accessible by fixing accessibility errors, for example

APA, Harvard, Vancouver, ISO, and other styles

7

KAJI, NOBUHIRO, and MASARU KITSUREGAWA. "Acquiring Polar Sentences from HTML Documents." Journal of Natural Language Processing 15, no. 3 (2008): 77–90. http://dx.doi.org/10.5715/jnlp.15.3_77.

Full text

APA, Harvard, Vancouver, ISO, and other styles

8

Gupta, Suhit, Gail E. Kaiser, Peter Grimm, Michael F. Chiang, and Justin Starren. "Automating Content Extraction of HTML Documents." World Wide Web 8, no. 2 (2005): 179–224. http://dx.doi.org/10.1007/s11280-004-4873-3.

Full text

APA, Harvard, Vancouver, ISO, and other styles

9

O, Geum-Yong, and In-Jun Hwang. "Automatically Converting HTML Documents with Similar Pattern into XML Documents." KIPS Transactions:PartD 9D, no. 3 (2002): 355–64. http://dx.doi.org/10.3745/kipstd.2002.9d.3.355.

Full text

APA, Harvard, Vancouver, ISO, and other styles

10

Vállez, Mari, Rafael Pedraza-Jiménez, Lluís Codina, Saúl Blanco, and Cristòfol Rovira. "A semi-automatic indexing system based on embedded information in HTML documents." Library Hi Tech 33, no. 2 (2015): 195–210. http://dx.doi.org/10.1108/lht-12-2014-0114.

Full text

Abstract:

Purpose – The purpose of this paper is to describe and evaluate the tool DigiDoc MetaEdit which allows the semi-automatic indexing of HTML documents. The tool works by identifying and suggesting keywords from a thesaurus according to the embedded information in HTML documents. This enables the parameterization of keyword assignment based on how frequently the terms appear in the document, the relevance of their position, and the combination of both. Design/methodology/approach – In order to evaluate the efficiency of the indexing tool, the descriptors/keywords suggested by the indexing tool ar

APA, Harvard, Vancouver, ISO, and other styles

11

THIEMANN, PETER. "A typed representation for HTML and XML documents in Haskell." Journal of Functional Programming 12, no. 4-5 (2002): 435–68. http://dx.doi.org/10.1017/s0956796802004392.

Full text

Abstract:

We define a family of embedded domain specific languages for generating HTML and XML documents. Each language is implemented as a combinator library in Haskell. The generated HTML/XML documents are guaranteed to be well-formed. In addition, each library can guarantee that the generated documents are valid XML documents to a certain extent (for HTML only a weaker guarantee is possible). On top of the libraries, Haskell serves as a meta language to define parameterized documents, to map structured documents to HTML/XML, to define conditional content, or to define entire web sites. The combinator

APA, Harvard, Vancouver, ISO, and other styles

12

Wu, Qi, Xing-shu Chen, Kai Zhu, and Chun-hui Wang. "Relevance-based content extraction of HTML documents." Journal of Central South University 19, no. 7 (2012): 1921–26. http://dx.doi.org/10.1007/s11771-012-1226-8.

Full text

APA, Harvard, Vancouver, ISO, and other styles

13

Gupta, Shivangi, and Mukesh Rawat. "Keyword based Automatic Summarization of HTML Documents." International Journal of Computer Applications 127, no. 8 (2015): 24–29. http://dx.doi.org/10.5120/ijca2015906421.

Full text

APA, Harvard, Vancouver, ISO, and other styles

14

Devendra, Gautam, Verma Priyanshu, and Kumar Singh Rakesh. "DOC Merge WEB-Application." Research and Applications: Emerging Technologies 5, no. 1 (2023): 45–51. https://doi.org/10.5281/zenodo.7889206.

Full text

Abstract:

<em>Document editors are software applications that allow users to create, edit, and format text-based documents. These tools provide a wide range of features and capabilities, including font and paragraph formatting options, spell checking and grammar checking, support for collaboration and sharing, and the ability to insert images and other media. The use of document editors has become increasingly widespread in recent years, thanks in part to the rise of cloud-based computing and the availability of powerful mobile devices. This has made it easier for people to create and edit documents fro

APA, Harvard, Vancouver, ISO, and other styles

15

Jann, Ben. "Creating HTML or Markdown Documents from within Stata using Webdoc." Stata Journal: Promoting communications on statistics and Stata 17, no. 1 (2017): 3–38. http://dx.doi.org/10.1177/1536867x1701700102.

Full text

Abstract:

In this article, I discuss the use of webdoc for creating HTML or Markdown documents from within Stata. The webdoc command provides a way to embed HTML or Markdown code directly in a do-file and automate the integration of results from Stata in the final document. The command can be used, for example, to create a webpage documenting your data analysis, including all Stata output and graphs. More generally, the command can be used to create and maintain a website that contains results computed by Stata.

APA, Harvard, Vancouver, ISO, and other styles

16

Plch, Roman, and Petra Sarmanova. "Interactive 3D Graphics in HTML and PDF Documents." Zpravodaj Československého sdružení uživatelů TeXu 18, no. 1-2 (2008): 76–92. http://dx.doi.org/10.5300/2008-1-2/76.

Full text

APA, Harvard, Vancouver, ISO, and other styles

17

SHINZATO, KEIJI, and KENTARO TORISAWA. "Automatic acquisition of hyponymy relations from HTML documents." Journal of Natural Language Processing 12, no. 1 (2005): 125–50. http://dx.doi.org/10.5715/jnlp.12.125.

Full text

APA, Harvard, Vancouver, ISO, and other styles

18

Altarturi, Hamza H. M., Muntadher Saadoon, and Nor Badrul Anuar. "Web content topic modeling using LDA and HTML tags." PeerJ Computer Science 9 (July 11, 2023): e1459. http://dx.doi.org/10.7717/peerj-cs.1459.

Full text

Abstract:

An immense volume of digital documents exists online and offline with content that can offer useful information and insights. Utilizing topic modeling enhances the analysis and understanding of digital documents. Topic modeling discovers latent semantic structures or topics within a set of digital textual documents. The Internet of Things, Blockchain, recommender system, and search engine optimization applications use topic modeling to handle data mining tasks, such as classification and clustering. The usefulness of topic models depends on the quality of resulting term patterns and topics wit

APA, Harvard, Vancouver, ISO, and other styles

19

White, Jason. "Using Markup Languages for Accessible Scientific, Technical, and Scholarly Document Creation." Journal of Science Education for Students with Disabilities 25, no. 1 (2022): 1–22. http://dx.doi.org/10.14448/jsesd.14.0005.

Full text

Abstract:

In using software to write a scientific, technical, or other scholarly document, authors have essentially two options. They can either write it in a ‘what you see is what you get’ (WYSIWYG) editor such as a word processor, or write it in a text editor using a markup language such as HTML, LaTeX, Markdown, or AsciiDoc. This paper gives an overview of the latter approach, focusing on both the non-visual accessibility of the writing process, and that of the documents produced. Currently popular markup languages and established tools associated with them are introduced. Support for mathematical no

APA, Harvard, Vancouver, ISO, and other styles

20

Pau, Gregoire, and Wolfgang Huber. "The hwriter package: Composing HTML documents with R objects." R Journal 1, no. 1 (2009): 22. http://dx.doi.org/10.32614/rj-2009-009.

Full text

APA, Harvard, Vancouver, ISO, and other styles

21

Lim, Jong-Gyun. "Using Coollists to index HTML documents in the Web." Computer Networks and ISDN Systems 28, no. 1-2 (1995): 147–54. http://dx.doi.org/10.1016/0169-7552(95)00114-0.

Full text

APA, Harvard, Vancouver, ISO, and other styles

22

Ibragimov, T. F., and A. A. Ferenets. "Automatic Annotation of HTML Documents Using the Microdata Standard." Automatic Documentation and Mathematical Linguistics 58, S5 (2024): S283—S288. https://doi.org/10.3103/s0005105525700359.

Full text

APA, Harvard, Vancouver, ISO, and other styles

23

Ibragimov, Timur Ferdinandovich, and Alexander Andreevich Ferenets. "Automatic Annotation of HTML Documents using the Microdata Standard." Russian Digital Libraries Journal 27, no. 5 (2024): 730–44. https://doi.org/10.26907/1562-5419-2024-27-5-730-744.

Full text

Abstract:

The development of an application based on machine learning methods for automatic annotation of web pages according to the Microdata standard is described, with the possibility of extension to other standards and injecting data to JSX files. Datasets were collected and prepared for training Machine Learning (ML) models. The ML model metrics were collected and analyzed.

APA, Harvard, Vancouver, ISO, and other styles

24

Attardi, Giuseppe, Sergio Marco, and Davide Salvi. "Categorisation by Context." JUCS - Journal of Universal Computer Science 4, no. (9) (1998): 719–36. https://doi.org/10.3217/jucs-004-09-0719.

Full text

Abstract:

Assistance in retrieving of documents on the World Wide Web is provided either by search engines, through keyword based queries, or by catalogues, which organise documents into hierarchical collections. Maintaining catalogues manually is becoming increasingly difficult due to the sheer amount of material on the Web, and therefore it will be soon necessary to resort to techniques for automatic classification of documents. Classification is traditionally performed by extracting information for indexing a document from the document itself. The paper describes the technique of categorisation by co

APA, Harvard, Vancouver, ISO, and other styles

25

Umehara, Masayuki, Koji Iwanuma, and Hirokazu Nagai. "A Case-Based Semi-automatic Transformation from HTML Documents to XML Ones — Using the Similarity between HTML Documents Constituting a Series —." Transactions of the Japanese Society for Artificial Intelligence 16, no. 5 (2001): 408–16. http://dx.doi.org/10.1527/tjsai.16.408.

Full text

APA, Harvard, Vancouver, ISO, and other styles

26

Umehara, Masayuki, Koji Iwanuma, and Hidetomo Nabashima. "A Case-Based Recognition of Semantic Structures in HTML Documents Which Constitutes a Document Series." Transactions of the Japanese Society for Artificial Intelligence 17, no. 6 (2002): 690–98. http://dx.doi.org/10.1527/tjsai.17.690.

Full text

APA, Harvard, Vancouver, ISO, and other styles

27

Ashraf, F., T. Ozyer, and R. Alhajj. "Employing Clustering Techniques for Automatic Information Extraction From HTML Documents." IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews) 38, no. 5 (2008): 660–73. http://dx.doi.org/10.1109/tsmcc.2008.923882.

Full text

APA, Harvard, Vancouver, ISO, and other styles

28

Manabe, Tomohiro, and Keishi Tajima. "Extracting logical hierarchical structure of HTML documents based on headings." Proceedings of the VLDB Endowment 8, no. 12 (2015): 1606–17. http://dx.doi.org/10.14778/2824032.2824058.

Full text

APA, Harvard, Vancouver, ISO, and other styles

29

ZHANG, LIHUA, and YIU-KAI NG. "A Query Engine for Retrieving Information from Chinese HTML Documents." International Journal of Computer Processing of Languages 17, no. 03 (2004): 135–64. http://dx.doi.org/10.1142/s0219427904001085.

Full text

APA, Harvard, Vancouver, ISO, and other styles

30

Haghish, E. F. "Markdoc: Literate Programming in Stata." Stata Journal: Promoting communications on statistics and Stata 16, no. 4 (2016): 964–88. http://dx.doi.org/10.1177/1536867x1601600409.

Full text

Abstract:

Rigorous documentation of the analysis plan, procedure, and computer codes enhances the comprehensibility and transparency of data analysis. Documentation is particularly critical when the codes and data are meant to be publicly shared and examined by the scientific community to evaluate the analysis or adapt the results. The popular approach for documenting computer codes is known as literate programming, which requires preparing a trilingual script file that includes a programming language for running the data analysis, a human language for documentation, and a markup language for typesettin

APA, Harvard, Vancouver, ISO, and other styles

31

Andarwati, Hayu, R. Rizal Isnanto, and Ike Pertiwi Windasari. "Sistem Informasi Manajemen Surat pada Dinas Pendapatan, Pengelolaan Keuangan dan Aset Daerah Kabupaten Pati." Jurnal Teknologi dan Sistem Komputer 2, no. 3 (2014): 195–202. http://dx.doi.org/10.14710/jtsiskom.2.3.2014.195-202.

Full text

Abstract:

Abstract - Documents handling at Dinas Pendapatan, Pengelolaan Keuangan dan Aset Daerah Kabupaten Pati was done manually and was not computerized. Because of that, it is needed to build documents management information system to automate the document control activity at department include the function of documents recording, documents making, and documents tracking. For gaining the purpose, a research must be done. That research used Framework for Application System Thinking (FAST) with eight phase, that is scope definition, problem analyst, requirement analyst, logical design, decision analys

APA, Harvard, Vancouver, ISO, and other styles

32

WIELEMAKER, JAN, ZHISHENG HUANG, and LOURENS VAN DER MEIJ. "SWI-Prolog and the web." Theory and Practice of Logic Programming 8, no. 3 (2008): 363–92. http://dx.doi.org/10.1017/s1471068407003237.

Full text

Abstract:

AbstractProlog is an excellent tool for representing and manipulating data written in formal languages as well as natural language. Its safe semantics and automatic memory management make it a prime candidate for programming robust Web services. Although Prolog is commonly seen as a component in a Web application that is either embedded or communicates using a proprietary protocol, we propose an architecture where Prolog communicates to other components in a Web application using the standard HTTP protocol. By avoiding embedding in external Web servers, development and deployment become much e

APA, Harvard, Vancouver, ISO, and other styles

33

Mattheos, Nikos, Anders Nattestad, and Rolf Attström. "Local CD-ROM in interaction with HTML documents over the Internet." European Journal of Dental Education 4, no. 3 (2000): 124–27. http://dx.doi.org/10.1034/j.1600-0579.2000.040306.x.

Full text

APA, Harvard, Vancouver, ISO, and other styles

34

Prateek, Raman* Ravi Kant Gautam Ravi Yadav Manish Kumar Sharma. "BUILDING HIGHLY SPECIALISED WEB CRAWLER USING B-TREE SEARCH AND HTML PARSER." International Journal OF Engineering Sciences & Management Research 3, no. 5 (2016): 98–102. https://doi.org/10.5281/zenodo.51574.

Full text

Abstract:

Web crawlers are Internet bot that automatically traverse the hyper-link structure of the world wide web in order to locate and retrieve information. This paper describes a web crawling approach based on B-tree search and HTML Parser. As the goal of crawler is to selectively seek out pages that are relevant to given keywords. Rather than collecting and indexing all available web documents to be able to answer all possible queries, a crawler analyze its crawl boundary to hit upon the links that are likely to be most relevant for the crawl, and avoids irrelevant links of the document.  &nbs

APA, Harvard, Vancouver, ISO, and other styles

35

Samola, Jonathan, Edwin Tenda, Benny Pinontoan, and Eliasta Ketaren. "THE IMPLEMENTATION OF BLOCKCHAIN SYSTEM IN VEHICLE REGISTRATION CERTIFICATE (BPKB) DATA BASED ON A WEBSITE." Jurnal TIMES 13, no. 2 (2024): 275–85. https://doi.org/10.51351/jtm.13.2.2024761.

Full text

Abstract:

The Vehicle Registration Certificate (STNK) and the Vehicle Ownership Document (BPKB) are crucial documents for identifying and proving vehicle ownership. Without a BPKB, verifying the legitimacy of a vehicle becomes difficult, potentially rendering it illegal and increasing the risk of it being classified as stolen. The process of verifying a BPKB at the central office involves lengthy and time-consuming procedures, complicating the process. The lack of efficient methods for verifying vehicle documents can lead to the production of counterfeit documents by unauthorized parties. This research

APA, Harvard, Vancouver, ISO, and other styles

36

Pereira, R. A. Marques, A. Molinari, and G. Pasi. "Contextual weighted representations and indexing models for the retrieval of HTML documents." Soft Computing 9, no. 7 (2004): 481–92. http://dx.doi.org/10.1007/s00500-004-0361-z.

Full text

APA, Harvard, Vancouver, ISO, and other styles

37

CABEZA, DANIEL, and MANUEL HERMENEGILDO. "Distributed WWW programming using (Ciao-)Prolog and the PiLLoW library." Theory and Practice of Logic Programming 1, no. 3 (2001): 251–82. http://dx.doi.org/10.1017/s147106840100117x.

Full text

Abstract:

We discuss from a practical point of view a number of issues involved in writing distributed Internet and WWW applications using LP/CLP systems. We describe PiLLoW, a public-domain Internet and WWW programming library for LP/CLP systems that we have designed to simplify the process of writing such applications. PiLLoW provides facilities for accessing documents and code on the WWW; parsing, manipulating and generating HTML and XML structured documents and data; producing HTML forms; writing form handlers and CGI-scripts; and processing HTML/XML templates. An important contribution of PiLLoW is

APA, Harvard, Vancouver, ISO, and other styles

38

Rajagopal, Prabha, Sri Devi Ravana, Yun Sing Koh, and Vimala Balakrishnan. "Evaluating the effectiveness of information retrieval systems using effort-based relevance judgment." Aslib Journal of Information Management 71, no. 1 (2019): 2–17. http://dx.doi.org/10.1108/ajim-04-2018-0086.

Full text

Abstract:

Purpose The effort in addition to relevance is a major factor for satisfaction and utility of the document to the actual user. The purpose of this paper is to propose a method in generating relevance judgments that incorporate effort without human judges’ involvement. Then the study determines the variation in system rankings due to low effort relevance judgment in evaluating retrieval systems at different depth of evaluation. Design/methodology/approach Effort-based relevance judgments are generated using a proposed boxplot approach for simple document features, HTML features and readability

APA, Harvard, Vancouver, ISO, and other styles

39

Al-Dallal, Ammar, and Rasha S. Abdul-Wahab. "GA on IR." International Journal of Artificial Life Research 3, no. 2 (2012): 1–14. http://dx.doi.org/10.4018/jalr.2012040101.

Full text

Abstract:

Increasing the growth rates of websites’ number has led to the challenge of assisting Web customers in finding appropriate details from the Internet using an intelligent search engine. Information retrieval (IR) is an essential and useful strategy for Web users; thus, different strategies and techniques are designed for such purpose. Currently, the focus on the usefulness of Artificial Intelligence (AI) has been improved with IR. One AI area is Evolutionary Computation (EC), which is based on designs of natural selection. A traditional and important strategy in EC is Genetic Algorithm (GA); th

APA, Harvard, Vancouver, ISO, and other styles

40

Goto, Kento, Ryosuke Koshijima, and Motomichi Toyama. "Responsive HTML generation using SuperSQL." International Journal of Web Information Systems 13, no. 3 (2017): 324–51. http://dx.doi.org/10.1108/ijwis-04-2017-0032.

Full text

Abstract:

Purpose With the rapid spread of smartphones and tablets, it is becoming necessary for web developers to create responsive web pages which are visually appealing on devices of various sizes. However, building responsive UIs is a very challenging task, requiring deep knowledge of HTML and CSS. This paper aims to propose an approach to generate responsive web pages using SuperSQL, which is an extension of SQL that can format data retrieved from a database into various kinds of structured documents. Design/methodology/approach By incorporating the methodology of bootstrap, a grid-based framework

APA, Harvard, Vancouver, ISO, and other styles

41

BRZEMINSKI, PAWEL, and WITOLD PEDRYCZ. "TEXTUAL-BASED CLUSTERING OF WEB DOCUMENTS." International Journal of Uncertainty, Fuzziness and Knowledge-Based Systems 12, no. 06 (2004): 715–43. http://dx.doi.org/10.1142/s021848850400317x.

Full text

Abstract:

In our study we presented an effective method for clustering of Web pages. From flat HTML files we extracted keywords, formed feature vectors as representation of Web pages and applied them to a clustering method. We took advantage of the Fuzzy C-Means clustering algorithm (FCM). We demonstrated an organized and schematic manner of data collection. Various categories of Web pages were retrieved from ODP (Open Directory Project) in order to create our datasets. The results of clustering proved that the method performs well for all datasets. Finally, we presented a comprehensive experimental stu

APA, Harvard, Vancouver, ISO, and other styles

42

Adefowoke Ojokoh, Bolanle, Olumide Sunday Adewale, and Samuel Oluwole Falaki. "Automated document metadata extraction." Journal of Information Science 35, no. 5 (2009): 563–70. http://dx.doi.org/10.1177/0165551509105195.

Full text

Abstract:

Web documents are available in various forms, most of which do not carry additional semantics. This paper presents a model for general document metadata extraction. The model, which combines segmentation by keywords and pattern matching techniques, was implemented using PHP, MySQL, JavaScript and HTML. The system was tested with 40 randomly selected PDF documents (mainly theses). An evaluation of the system was done using standard criteria measures namely precision, recall, accuracy and F-measure. The results show that the model is relatively effective for the task of metadata extraction, espe

APA, Harvard, Vancouver, ISO, and other styles

43

Peroni, Silvio, Francesco Osborne, Angelo Di Iorio, et al. "Research Articles in Simplified HTML: a Web-first format for HTML-based scholarly articles." PeerJ Computer Science 3 (October 2, 2017): e132. http://dx.doi.org/10.7717/peerj-cs.132.

Full text

Abstract:

PurposeThis paper introduces the Research Articles in Simplified HTML (or RASH), which is a Web-first format for writing HTML-based scholarly papers; it is accompanied by the RASH Framework, a set of tools for interacting with RASH-based articles. The paper also presents an evaluation that involved authors and reviewers of RASH articles submitted to the SAVE-SD 2015 and SAVE-SD 2016 workshops.DesignRASH has been developed aiming to: be easy to learn and use; share scholarly documents (and embedded semantic annotations) through the Web; support its adoption within the existing publishing workfl

APA, Harvard, Vancouver, ISO, and other styles

44

HaCohen-Kerner, Yaakov, Ittay Stern, David Korkus, and Erick Fredj. "AUTOMATIC MACHINE LEARNING OF KEYPHRASE EXTRACTION FROM SHORT HTML DOCUMENTS WRITTEN IN HEBREW." Cybernetics and Systems 38, no. 1 (2007): 1–21. http://dx.doi.org/10.1080/01969720600998546.

Full text

APA, Harvard, Vancouver, ISO, and other styles

45

Sanka, Anoop, Shravan Chamakura, and Sharma Chakravarthy. "A dataflow approach to efficient change detection of HTML/XML documents in WebVigiL." Computer Networks 50, no. 10 (2006): 1547–63. http://dx.doi.org/10.1016/j.comnet.2005.10.016.

Full text

APA, Harvard, Vancouver, ISO, and other styles

46

Zhang, Xiaoming, Pengtao Lv, Chongchong Zhao, and Jianxian Wang. "A Method for Materials Knowledge Extraction from HTML Tables Based on Sibling Comparison." International Journal of Software Engineering and Knowledge Engineering 26, no. 06 (2016): 897–926. http://dx.doi.org/10.1142/s0218194016500303.

Full text

Abstract:

There are rich data resources residing in available materials websites, and most of these data resources are shown in the form of HTML tables. However, it is difficult to distinguish the attributes and values because of the semi-structured feature of HTML tables. Therefore, identifying attributes in HTML tables is the key issue for the information acquisition. In this paper, based on sibling comparison, a method for materials knowledge extraction from HTML tables is proposed, which consists of three steps: acquiring sibling tables, identifying table pattern and extracting table data. We show h

APA, Harvard, Vancouver, ISO, and other styles

47

Apoorva, Ganapathy, Vadlamudi Siddhartha, Al Ayub Ahmed Alim, Shakawat Hossain Md., and Aminul Islam Md. "HTML Content and Cascading Tree Sheets: Overview of Improving Web Content Visualization." Turkish Online Journal of Qualitative Inquiry 12, no. 3 (2021): 2428–38. https://doi.org/10.5281/zenodo.5522159.

Full text

Abstract:

The database system has contributed immensely to the present state of web pages and caching. Also, there are so many layers in broad-spectrum for catching numerous web content with culmination with extension .html, XML, .json, .txt, .jpg, .pdf, .png, .gif among others. Examples include web servers, content delivery networks, etc. But, these commonalities of presentational HTML cannot separate HTML content, which enables HTML documents harder to create, reuse, maintain, and tailor. Hence the objective of this study was aimed at cascading cache layer in a content management system using cascadin

APA, Harvard, Vancouver, ISO, and other styles

48

M, ABISORNAM. "Document Verification System." INTERNATIONAL JOURNAL OF SCIENTIFIC RESEARCH IN ENGINEERING AND MANAGEMENT 09, no. 05 (2025): 1–9. https://doi.org/10.55041/ijsrem46911.

Full text

Abstract:

ABSTRACT The Document Verification System is a web-based solution developed to streamline and enhance the process of verifying official documents with accuracy and efficiency. Built using modern web technologies including HTML, CSS, JavaScript, Node.js and integrated with Optical Character Recognition (OCR) capabilities, the system aims to automate and simplify document validation. Users can upload scanned copies or images of documents such as ID proofs, academic certificates, or business records. The system utilizes OCR to extract text content from the uploaded files and cross-verifies the ex

APA, Harvard, Vancouver, ISO, and other styles

49

Mhawi, Doaa N., Haider W. Oleiwi, Nagham H. Saeed, and Heba L. Al-Taie. "An Efficient Information Retrieval System Using Evolutionary Algorithms." Network 2, no. 4 (2022): 583–605. http://dx.doi.org/10.3390/network2040034.

Full text

Abstract:

When it comes to web search, information retrieval (IR) represents a critical technique as web pages have been increasingly growing. However, web users face major problems; unrelated user query retrieved documents (i.e., low precision), a lack of relevant document retrieval (i.e., low recall), acceptable retrieval time, and minimum storage space. This paper proposed a novel advanced document-indexing method (ADIM) with an integrated evolutionary algorithm. The proposed IRS includes three main stages; the first stage (i.e., the advanced documents indexing method) is preprocessing, which consist

APA, Harvard, Vancouver, ISO, and other styles

50

DEL ROSARIO, Marco Jr, and Julius SARENO. "Theses and Capstone Projects Plagiarism Checker using Kolmogorov Complexity Algorithm." Walailak Journal of Science and Technology (WJST) 17, no. 7 (2020): 726–44. http://dx.doi.org/10.48048/wjst.2020.6498.

Full text

Abstract:

In education, students attempt to copy previous works and are relying on prepared solutions available on the Internet in order to meet their requirements. This action leads to plagiarism, which is becoming part of educational institutions’ concern to reduce growing academic dishonesty. With regards to the aforementioned issue, this study aims to design and develop a plagiarism checker capable of registering documents, granting access to users, and calculating the similarity between documents. Thus, the software was constructed using HTML, PHP, JavaScript, CSS, and MySQL. The developed system i

APA, Harvard, Vancouver, ISO, and other styles

We offer discounts on all premium plans for authors whose works are included in thematic literature selections. Contact us to get a unique promo code!