Repositorio Institucional
Repositorio Institucional
CONICET Digital
  • Inicio
  • EXPLORAR
    • AUTORES
    • DISCIPLINAS
    • COMUNIDADES
  • Estadísticas
  • Novedades
    • Noticias
    • Boletines
  • Ayuda
    • General
    • Datos de investigación
  • Acerca de
    • CONICET Digital
    • Equipo
    • Red Federal
  • Contacto
JavaScript is disabled for your browser. Some features of this site may not work without it.
  • INFORMACIÓN GENERAL
  • RESUMEN
  • ESTADISTICAS
 
Artículo

Active learning approach to label network traffic datasets

Guerra Torres, Jorge LuisIcon ; Catania, Carlos AdrianIcon ; Veas, Eduardo
Fecha de publicación: 12/2019
Editorial: Elsevier
Revista: Journal of Information Security and Applications
ISSN: 2214-2126
Idioma: Inglés
Tipo de recurso: Artículo publicado
Clasificación temática:
Otras Ingenierías y Tecnologías

Resumen

In the field of network security, the process of labeling a network traffic dataset is specially expensive since expert knowledge is required to perform the annotations. With the aid of visual analytic applications such as RiskID, the effort of labeling network traffic is considerable reduced. However, since the label assignment still requires an expert pondering several factors, the annotation process remains a difficult task. The present article introduces a novel active learning strategy for building a random forest model based on user previously-labeled connections. The resulting model provides to the user an estimation of the probability of the remaining unlabeled connections helping him in the traffic annotation task. The article describes the active learning strategy, the interfaces with the RiskID system, the algorithms used to predict botnet behavior, and a proposed evaluation framework. The evaluation framework includes studies to assess not only the prediction performance of the active learning strategy but also the learning rate and resilience against noise as well as the improvements on other well known labeling strategies. The framework represents a complete methodology for evaluating the performance of any active learning solution. The evaluation results showed proposed approach is a significant improvement over previous labeling strategies.
Palabras clave: ACTIVE LEARNING , LABELING NETWORK , LEARNING RATE , NOISE ROBUSTNESS , RANDOM FOREST
Ver el registro completo
 
Archivos asociados
Tamaño: 1.766Mb
Formato: PDF
.
Solicitar
Licencia
info:eu-repo/semantics/restrictedAccess Excepto donde se diga explícitamente, este item se publica bajo la siguiente descripción: Creative Commons Attribution-NonCommercial-ShareAlike 2.5 Unported (CC BY-NC-SA 2.5)
Identificadores
URI: http://hdl.handle.net/11336/135852
DOI: https://doi.org/10.1016/j.jisa.2019.102388
URL: https://www.sciencedirect.com/science/article/abs/pii/S2214212618304344
Colecciones
Articulos(CCT - MENDOZA)
Articulos de CTRO.CIENTIFICO TECNOL.CONICET - MENDOZA
Citación
Guerra Torres, Jorge Luis; Catania, Carlos Adrian; Veas, Eduardo; Active learning approach to label network traffic datasets; Elsevier; Journal of Information Security and Applications; 49; 12-2019; 1-13
Compartir
Altmétricas
 

Enviar por e-mail
Separar cada destinatario (hasta 5) con punto y coma.
  • Facebook
  • X Conicet Digital
  • Instagram
  • YouTube
  • Sound Cloud
  • LinkedIn

Los contenidos del CONICET están licenciados bajo Creative Commons Reconocimiento 2.5 Argentina License

https://www.conicet.gov.ar/ - CONICET

Inicio

Explorar

  • Autores
  • Disciplinas
  • Comunidades

Estadísticas

Novedades

  • Noticias
  • Boletines

Ayuda

Acerca de

  • CONICET Digital
  • Equipo
  • Red Federal

Contacto

Godoy Cruz 2290 (C1425FQB) CABA – República Argentina – Tel: +5411 4899-5400 repositorio@conicet.gov.ar
TÉRMINOS Y CONDICIONES