Website Category Classification Using Fine-Tuned Bert Language Model

Demirkıran, Ferhat; Dağ, Hasan; Çayır, Aykut; Ünal, Uğur

Website Category Classification Using Fine-Tuned Bert Language Model

Date

2020

Authors

Demirkıran, Ferhat

Dağ, Hasan

Çayır, Aykut

Demirkıran, Ferhat

Ünal, Uğur

Dağ, Hasan

Publisher

Institute of Electrical and Electronics Engineers Inc.

Organizational Units

Organizational Unit

Management Information Systems

Abstract

The contents on the Word Wide Web is expanding every second providing web users a rich content. However, this situation may cause web users harm rather than good due to its harmful or misleading information. The harmful contents can contain text, audio, video, or image that can be about violence, adult contents, or any other harmful information. Especially young people may readily be affected with these harmful information psychologically. To prevent youth from these harmful contents, various web filtering techniques, such as keyword filtering, Uniform Resource Locator (URL) based filtering, Intelligent analysis, and semantic analysis, are used. We propose an algorithm that can classify websites, which may contain adult contents, with 67.81% (BERT) accuracy among 32 unique categories. We also show that a BERT model gives higher accuracy than both the Sequential and Functional API models when used for text classification.

Keywords

BERT, Functional API, Sequential API, Text classification, Web filtering

Start Page

333

End Page

336

URI

https://hdl.handle.net/20.500.12469/3562
https://doi.org/10.1109/UBMK50275.2020.9219384

Collections

Scopus İndeksli Yayınlar Koleksiyonu
WoS İndeksli Yayınlar Koleksiyonu
Yönetim Bilişim Sistemleri Bölümü Koleksiyonu

Full item page

Website Category Classification Using Fine-Tuned Bert Language Model

Date

Authors

Journal Title

Journal ISSN

Volume Title

Publisher

Open Access Color

OpenAIRE Downloads

OpenAIRE Views

Research Projects

Organizational Units

Journal Issue

Events

Abstract

Description

Keywords

Turkish CoHE Thesis Center URL

Fields of Science

Citation

WoS Q

Scopus Q

Source

Volume

Issue

Start Page

End Page

URI

Collections