Split inference (SI) has been devised as a valuable solution to enable the execution of computation-heavy deep neural network (DNN) inference models on resource-constrained edge devices. The different layers of a DNN model are distributed to one or several nodes (mainly edge/cloud servers) cooperating with the end-device requesting the inference. Boosted by the sixth generation (6 G) trends, programmable network nodes equipped with computing, caching and intelligence capabilities can be involved in such a cooperative task. In this work, we propose Named Data Networking (NDN) as a key enabler of in-network SI. Its connectionless communication model coupled with routing-by-name and native in-network caching capabilities can facilitate dynamic splitting operations on nodes throughout the cloud-to-things continuum. We show how NDN design principles and communication primitives can be leveraged to support in-network SI. Then, preliminary results are reported to showcase the benefits of the conceived proposal.

In-Network Edge Split Inference via Named Data Networking

Amadeo, Marica
Primo
;
Molinaro, Antonella;
2025-01-01

Abstract

Split inference (SI) has been devised as a valuable solution to enable the execution of computation-heavy deep neural network (DNN) inference models on resource-constrained edge devices. The different layers of a DNN model are distributed to one or several nodes (mainly edge/cloud servers) cooperating with the end-device requesting the inference. Boosted by the sixth generation (6 G) trends, programmable network nodes equipped with computing, caching and intelligence capabilities can be involved in such a cooperative task. In this work, we propose Named Data Networking (NDN) as a key enabler of in-network SI. Its connectionless communication model coupled with routing-by-name and native in-network caching capabilities can facilitate dynamic splitting operations on nodes throughout the cloud-to-things continuum. We show how NDN design principles and communication primitives can be leveraged to support in-network SI. Then, preliminary results are reported to showcase the benefits of the conceived proposal.
File in questo prodotto:
Non ci sono file associati a questo prodotto.
Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11570/3342269
 Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 0
  • ???jsp.display-item.citation.isi??? 0
social impact