
		<paper>
			<loc>https://jjcit.org/paper/169</loc>
			<title>A NOVEL TRUE-REAL-TIME SPATIOTEMPORAL DATA STREAM PROCESSING FRAMEWORK</title>
			<doi>10.5455/jjcit.71-1646838830</doi>
			<authors>Ature Angbera,Huah Yong Chan</authors>
			<keywords>Spatiotemporal  big  data,Real-time  processing,Stream  processing,Apache  Kafka,Apache  Flink,Apache Cassandra,Apache Spark</keywords>
			<citation>5</citation>
			<views>6576</views>
			<downloads>1578</downloads>
			<received_date>9-Mar.-2022</received_date>
			<revised_date>  2-May-2022</revised_date>
			<accepted_date>  23-May-2022</accepted_date>
			<abstract>The  ability  to  interpret  spatiotemporal  data  streams  in  real time  is  critical  for  a  range  of  systems.  However, 
processing  vast  amounts  of  spatiotemporal  data  out  of several sources,  such  as  online  traffic,  social  platforms, 
sensor  networks and other  sources,  is  a  considerable  challenge.  The  major  goal  of  this  study  is  to  create a 
framework  for  processing  and analyzing spatiotemporal  data  from  multiple  sources  with  irregular  shapes, so 
that researchers can focus on data analysis instead of worrying about the data sources' structure. We introduced 
a novel spatiotemporal data paradigm for true-real-time stream processing, which enables high-speed and low-
latency real-time  data processing, with these considerations in mind. A comparison of two state-of-the-art real-
time process architectures was offered, as well as a full review of the various open-source technologies for real-
time  data  stream  processing and their  system  topologies were also presented.  Hence,  this  study  proposed  a 
brand-new  framework  that  integrates  Apache  Kafka  for  spatiotemporal  data  ingestion,  Apache Flink for  true- 
real-time  processing  of  spatiotemporal  stream  data,  as  well  as  machine  learning  for  real-time  predictions and 
Apache  Cassandra  at  the  storage  layer  for  distributed  storage  in  real time.  The  proposed  framework  was 
compared  with  others from  the  literature using  the  following  features:  Scalability  (Sc),  prediction  tools  (PT), 
data analytics (DA), multiple  event types (MET), data storage  (DS), Real-time  (Rt) and performance evaluation 
(PE) stream processing (SP) and our proposed framework provided the ability to handle all of these tasks.</abstract>
		</paper>


