hkr.sePublications
Planned maintenance
A system upgrade is planned for 10/12-2024, at 12:00-13:00. During this time DiVA will be unavailable.
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Optimization for search engines based on external revision database
Kristianstad University, Faculty of Natural Science.
Kristianstad University, Faculty of Natural Science.
2020 (English)Independent thesis Basic level (degree of Bachelor), 10 credits / 15 HE creditsStudent thesis
Abstract [en]

The amount of data is continually growing and the ability to efficiently search through vast amounts of data is almost always sought after. To efficiently find data in a set there exist many technologies and methods but all of them cost in the form of resources like cpu-cycles, memory and storage. In this study a search engine (SE) is optimized using several methods and techniques. Thesis looks into how to optimize a SE that is based on an external revision database.The optimized implementation is compared to a non-optimized implementation when executing a query. An artificial neural network (ANN) trained on a dataset containing 3 years normal usage at a company is used to prioritize within the resultset before returning the result to the caller. The new indexing algorithms have improved the document space complexity by removing all duplicate documents that add no value. Machine learning (ML) has been used to analyze the user behaviour to reduce the necessary amount of documents that gets retrieved by a query.

Place, publisher, year, edition, pages
2020. , p. 30
Keywords [en]
Search engine, optimization, revision database, machine learning, retrieval models, complexity
National Category
Computer Sciences
Identifiers
URN: urn:nbn:se:hkr:diva-21000OAI: oai:DiVA.org:hkr-21000DiVA, id: diva2:1459914
External cooperation
Infrasight Labs
Educational program
Bachelor programme in Computer Software Development
Presentation
2020-06-02, Online, Kristianstad, 10:28 (English)
Supervisors
Examiners
Available from: 2020-08-24 Created: 2020-08-21 Last updated: 2020-08-24Bibliographically approved

Open Access in DiVA

fulltext(806 kB)246 downloads
File information
File name FULLTEXT01.pdfFile size 806 kBChecksum SHA-512
a5ff0b0266f124c3feb4c8bddfbfc0a1b78cc10b1b384508f393f80abf63c98235bffa84dfaf03cee7797e17f76997c5ded9f0975ceba2bc35797729c6de434c
Type fulltextMimetype application/pdf

Search in DiVA

By author/editor
Lemón Larsson, Fredrik
By organisation
Faculty of Natural Science
Computer Sciences

Search outside of DiVA

GoogleGoogle Scholar
Total: 246 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

urn-nbn

Altmetric score

urn-nbn
Total: 501 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf