Advances in computational power and parallel processing abilities, as well as the availability of the massive amount of data, created considerable interest in deep learning (DL) in many fields of wireless communications research . The proliferation of smart devices also enabled a wide range of location-based services that relies on DL-based Indoor Positioning Systems (IPS) utilizing sensory data gathered from the smart devices . These systems require an immense amount of data for training. The data gathering process is time-consuming and strenuous. Data crowdsourcing has recently gained attention to alleviate the difficulty of data gathering by allowing public users to contribute to datasets in a participatory sensing manner.
Crowdsourcing is an essential solution for creating large datasets for DL for IPS, which is also used in various applications such as image recognition, medical research, and market research. It is used to reduce the required effort for implementing DL-based IPS . The traditional crowdsourcing technique relies on the users who send their data directly to the centralized server, which violates users’ privacy. There are methods for handling privacy-related problems such as data anonymization, encrypted evaluation, and noise injection. However, methods such as encrypted evaluation and anonymization are challenging to implement in real-world applications due to computational complexity. Additionally, methods similar to noise injection may cause long term degradation in the performance of DL-based applications. Recently, Federated Learning is an emerging method that enables privacy-preserving deep learning while keeping the benefits of crowdsourcing .
Federated learning has gathered a tremendous interest recently due to its privacy-preserving nature as well as the efficient use of resources by utilizing the processing power of edge devices . The local models are trained with user data at the device, and only models which are smaller in size are shared with the centralized server . Hence, federated learning is also communication-wise efficient since it avoids the transmission of vast amounts of data .
In this paper, we propose a novel method utilizing federated learning to create an RSS fingerprint-based localization crowdsourcing system, as illustrated in Fig. 1. Our main aim is creating a localization system employing crowdsourcing without violating user privacy. We designed a multilayer perceptron (MLP) as a global training model to extract location information from RSS features. We utilized TensorFlow and Keras libraries for simulating our federated learning approach using the UJIndoorLoc database . Numerical results show that our proposed
Fig. 1: Federated learning-based crowdsourcing system.
method is a viable solution for crowdsourcing for RSS fingerprint-based localization systems.
Compared to existing DL-based indoor localization systems, we list our main contributions as below:
• The main contribution of this work is enabling a privacy-preserving crowdsourcing technique
• Thirdly, we test our proposed method using experimental data captured in real-world settings
• Finally, we demonstrate that federated learning can perform localization accuracy (up to
This paper is structured as follows. In Section II, a brief overview of existing literature on traditional machine learning-based fingerprint localization techniques are presented. We explain our proposed federated learning for RSS fingerprint-based localization technique in Section III. Subsequently, we analyze the numerical results for three comprehensive scenarios in Section IV. Finally, we present our concluding remarks in Section V.
Fingerprinting-based localization is highly popular due to ease of implementation and high accuracy. There are several traditional fingerprinting applications, such as , where the system employs a probabilistic approach utilizing the RSS histogram of the access points (APs) for location clustering. Another well-known technique is a statistical approach based on k Nearest Neighbors (kNN). In this approach, the differences in the RSS values with various distance metrics for corresponding reference points (RP) in the database. The RP with the smallest difference for the distance metric is estimated as the location of the user .
Localization research utilizing deep learning has gained momentum with the availability of vast amounts of data incorporating location-related information gathered from smart devices as well as an increase in computational power. The authors propose a method called DeepFi based on DL using channel state information (CSI) fingerprint as input in , and they compared their method and shown excellence to traditional methods such as HORUS  and maximum likelihood . The proposed DeepFi method outperformed several traditional RSS and CSI based schemes in two representative indoor environments. The authors subsequently increased the accuracy utilizing calibrated phase information extracted from CSI  as the fingerprint. In another study, a Convolutional Neural Network (CNN) model is employed utilizing the angle of arrival information . However, each of these techniques requires hardware modifications on user devices and gathering sensitive user data for centralized training.
All of the traditional localization techniques are based on centralized training models, and they require the transmission of sensitive data from users, jeopardizing user privacy. However, the increasing awareness about privacy and the policies for rules and regulations from legislative entities are restricting the reach of those methods to the sensitive data. Hence a solution, such as federated learning, is not only viable but also required for future localization systems.
Federated learning is an emerging solution for privacy-preserving distributed learning  since it is an answer to several shortcomings of traditional machine learning algorithms. Federated learning allows multiple users to train their models locally without revealing their sensitive data to the centralized unit. It is communication-wise efficient since models are significantly smaller than the raw data in most of the applications . Federated learning is proven to be robust against unbalanced and non-IID data. It is applicable to a variety of deep learning methods such as MLP, CNN, and Long-Short-Term-Memory Recurrent Neural Networks. In our proposed method, we
Fig. 2: MultiLayer Perceptron global model.
use MLP architecture with federated learning for the RSS Fingperint-based localization system to overcome problems of traditional deep learning-based localization systems.
A. Data Preprocessing
In this paper, we consider a privacy-preserving federated learning-based crowdsourcing system for RSS Fingerprint-based localization for an indoor scenario, as seen in Fig.1. The initial global MLP model is generated based on the training data gathered for the centralized server to determine the number of inputs k, which is the number of fixed APs in the building, as in Fig. 2. Initially, a set of m training samples of RSS measurements are collected from a set of k fixed APs, broadcasting WiFi beacons in the building, and a set of n users gathering RSS measurements from received beacons. The labeling of ith sample with the location information
is done in the cartesian coordinates of the RSS measurement and represented as
Each training sample RSS values from a subset APs of k APs (i.e., not all APs are in the coverage of the user at the same time. The RSS value of beacon from jth AP for ith training sample is represented with . Hence the measurement vector for ith sample
can be defined as
where M is the set of all training samples, and is the subset of APs with an RSS value in the ith training sample.
Thus, the initial MLP model has k nodes in the input layer, which involves the vector of RSS values from WiFi beacons. In case the beacon from any AP is not available (that AP is not in the coverage), the RSS value of the corresponding AP is set to a constant value (C) as a missing value. The constant for missing value is set to a lower value than the minimum RSS possible, which represents it is not in the coverage. In our simulations, we used to represent missing RSS values. Hence the below equation represents the input vector for MLP model for
a sample i
B. Global MLP Model and Centralized Training
The global model has a defined amount number of hidden nodes in specified numbers of hidden (dense) layers. After the generation of the global model, it is trained with existing m samples. The MLP training can be shown as a minimization problem considering loss function
f as below
where w is the model weights, m is the number of training samples, th input training sample, and is the label for ith sample. The global MLP model is trained in the centralized server. The following step is transmitting the global MLP model to the users for federated learning, as described in the next subsection.
C. Federated Learning
The general procedure for federated learning is summarized in Fig. 3. The general model is transmitted to the users by the centralized server. In each round, n users gather training data (RSS measurements) from APs in the vicinity and label them with corresponding measurement locations in different areas of the building. These users may change their positions; hence each measurement may consist beacons from different APs nearby at each sample. Subsequently, the users train the general model with their data, considering the minimization problem given in (5). Each user conveys its local model and its number of training samples to the centralized server.
Fig. 3: Federated Learning procedure.
The centralized server applies the federated weighted averaging and updates the global model
based on local models of users as below
where H is the total number of training samples, t is the round. After federated averaging, the global model is updated and transmitted back to the edge devices. The edge devices retrain the global model with their local data. These steps are repeated until convergence.
In this section, the performance of the proposed crowdsourcing method is benchmarked to a centralized DL method in various settings and scenarios. Publicly available RSS data from UJIIndoorLoc database  is used to evaluate our proposed federated learning-based localization system. The UJIIndoorLoc database includes 19937 samples distributed over 390 meters in length to 270 meters in width area over four floors for training and testing. It has RSS values from 520 distinct APs gathered labeled with timestamp and location information in longitude and latitude.
The simulation parameters for federated learning are summarized in Table I. TensorFlow and Keras libraries are utilized to implement the proposed method. The optimizer is selected as Adam . The learning rate and optimizer parameters are found with heuristic search and are as shown in Table I. The MLP layers are created as 20x10x10x10 due to the processing power
TABLE I: Federated Learning system parameters.
and energy limitations of edge devices. More hidden layers with a larger number of hidden nodes would allow better training and feature extraction of RSS measurements. However, it would be power consuming and computationally overloading for the edge devices. The use of federated learning with this MLP formation would not bear a significant computational or energy burden on today’s powerful, Neural Processing Unit-integrated chipsets such as Snapdragon 845 or Kirin 970 . Batch size and the number of epochs for MLP learning are set to 100 and 10 per round. Three generic scenarios are simulated with UJIIndoorLoc database  to prove the merits of our methodology. The test dataset has 4000 samples in all scenarios uniformly distributed over the building, assumed to represent a real use case.
1) Scenario-1: As discussed in III-C, the user data is not transmitted in federated learning,
instead the local models are trained. In this scenario, we show the prominence of using federated learning for crowdsourcing of RSS fingerprint for localization. The centralized server has 3000 training samples distributed uniformly over the building in its dataset. The centralized training utilizes the centralized database due to the privacy concerns of users. In the federated learning, each user provides its local model trained by their 500 training samples in each round to the centralized server. The centralized server applies federated averaging as in (6), including its centralized model. The localization accuracy increases with additional users due to the utilization
TABLE II: MAE (m) at round t=100 for various scenarios.
of local datasets through federated learning.
2) Scenario-2: The performance of the proposed method is benchmarked against the centralized
training method in this scenario. We divide the uniformly distributed dataset into 1000 training samples per user, and the federated learning is applied as discussed in Section III-C. Separately, the centralized server has the same user data with federated learning and combined them at its database for centralized training. Hence, the performance of federated learning and centralized learning is compared with the same dataset.
3) Scenario-3: The functionality of the proposed method is tested against both location and
training size heterogeneity, similar to real-world applications. In this realistic scenario, instead of the same amount of data per user as tested in Scenario-2, the amount of data per user is a random variable with a Gaussian distribution with 1000 samples as a mean and 500 as standard deviation. Additionally, these samples are spatially divided; thus, each user covers a different exclusive region of the building. Each user trains its local model for a different region, which makes the convergence time of the global model longer. Hence, there are two realistic challenges against federated learning; user data size heterogeneity and sample location heterogeneity.
A summary of simulation results are provided in Table II. We used the Mean absolute error (MAE) to benchmark the performance of our proposed method. MAE is an essential metric for localization systems . Federated learning could achieve MAE below 5 meters in simulations, which is very reasonable considering the dimensions of the building (390 meters x 270 meters) that is used for creating the dataset . The necessity of using federated learning is presented in
Fig. 4: Effect of number of users and various heterogeneity scenarios on accuracy.
Scenario-1, where federated learning is used as a booster to the centralized learning. It has shown that the accuracy is improved significantly with the contributions of local models while keeping user privacy. The difference between centralized learning and federated learning is always lower than 1 meter for Scenario-2, and just above 1 meter for Scenario-3 due to its heterogeneous nature. Federated learning achieves better results with the increasing number of users, and its performance gets closer to centralized learning.
The effect of the number of users on federated learning localization accuracy is shown in Fig. 4.
As explained in Scenario-1, using federated learning instead of relying upon only a centralized database increases localization accuracy remarkably. Even using a single additional contributor in federated setting drops the localization MAE by 0.65 meters, as shown in Fig. 4(a). Additional crowdsourcing participants notably decrease the test localization MAE. The test localization MAE decreases by 0.9, 1.23, and 1.8 meters for n = 5, 10, and 15, respectively. It is shown that the training error of the centralized learning is (2.43 meters) is relatively lower compared to the training error of federated method (3.09 meters) for five users in the system (n = 5) for Scenario-1. The difference between training and test results is highest in this scenario due to the smaller number of samples. Note that the training and the test accuracy of the centralized method are constant due to using the same dataset since, in this scenario, users reject to provide their local data due to privacy concerns.
We provide a more realistic benchmark for the performance of federated learning in Fig. 4(b), considering Scenario-2. In this scenario, we compare the performance of federated learning and centralized learning with the same sets of data. The users waive their privacy on the data in case of centralized learning and transmit their collected data to the centralized server. The training is done with all the user data. In the case of federated learning, the users keep their privacy by training the model locally and keeping the data where it is generated. The performance of the localization accuracy difference is minimal, considering the gain in the privacy of users. The difference between centralized learning and federated learning is always below 1 meter.
Scenario-3 has the most challenging task for the proposed federated learning-based method, as shown in Fig.4(c). The MAE is 5.53 meters for the federated model compared to 4.27 meters of the centralized model for the test dataset. One should note that the reason behind this gap is increased convergence time, and the federated model reduces the gap of less than 1 meter after several rounds past 100 rounds.
Change in the MAE throughout rounds is shown for Scenario-2 is given in Fig. 5(a) to Fig. 5(c) with 5, 10, and 15 users. Uniformly distributed data causes early convergence and overfitting afterward, hence resulting in high training accuracy and lower test accuracy. For example, the federated model converges around 40 rounds with 15 users, and the gap between training and test accuracy increases afterward for both centralized and federated models.
The effect of location heterogeneity on MAE is presented in Fig.5(d) to Fig. 5(f). Note that, the heterogeneity affects both centralized and federated training, as well as increasing the convergence time. The ramifications of overfitting are less visible compared to the centralized
Fig. 5: Effect of number of users and various heterogeneity scenarios on accuracy and convergence time.
method since federated averaging handles heterogeneous models, which eventually increases variance in the global model, and reduces the bias. It can be seen that although training accuracy is lower than the uniform case, the test accuracy is higher due to the avoidance of overfitting. The MAE is 5.53 with 15 users. Heterogeneity in the number of training samples is another challenge for Federated Learning . It is shown that increasing heterogeneity increases convergence time and lowers accuracy. However, with enough users, the error eventually drops.
In this paper, we propose a federated learning-based technique to provide privacy-preserving crowdsourcing for localization. The proposed method assures the utilization of the private local data by keeping the data where it is generated by only sharing the local models. We demonstrate the prominence of federated learning for improving localization accuracy. We show that the localization accuracy can be improved up to 1.8 meters using federated learning on top of the centralized dataset. Federated learning could achieve below 5 meters MAE localization performance. We explored the effect of unbalanced sample-size and non-IID spatial heterogeneity of user data on the performance of federated learning.
In our experimental results, we can localize users within 4.98 meters accuracy in a 390 meters to 270 meters building utilizing RSS fingerprint federated learning-based localization. We are also able to show the performance of the federated learning close to the centralized training (i.e., within 1 meters for most cases). In general, we show that federated learning performs well for crowdsourcing of RSS fingerprint for localization in real-world settings while preserving the privacy of the users.
 C. Zhang, P. Patras, and H. Haddadi, “Deep learning in mobile and wireless networking: A survey,” IEEE Communications Surveys Tutorials, vol. 21, no. 3, pp. 2224–2287, thirdquarter 2019.
 F. Zafari, A. Gkelias, and K. K. Leung, “A survey of indoor localization systems and technologies,” IEEE Communications Surveys Tutorials, vol. 21, no. 3, pp. 2568–2599, thirdquarter 2019.
 B. Wang, Q. Chen, L. T. Yang, and H. Chao, “Indoor smartphone localization via fingerprint crowdsourcing: challenges and approaches,” IEEE Wireless Communications, vol. 23, no. 3, pp. 82–89, June 2016.
 J. Konen, H. B. McMahan, F. X. Yu, P. Richtrik, A. T. Suresh, and D. Bacon, “Federated learning: Strategies for improving communication efficiency,” 2016.
 S. Niknam, H. S. Dhillon, and J. H. Reed, “Federated learning for wireless communications: Motivation, opportunities and challenges,” arXiv preprint arXiv:1908.06847, 2019.
 H. B. McMahan, E. Moore, D. Ramage, S. Hampson, and B. A. y Arcas, “Communication-efficient learning of deep networks from decentralized data,” 2016.
 J. Park, S. Samarakoon, M. Bennis, and M. Debbah, “Wireless network intelligence at the edge,” arXiv preprint arXiv:1812.02858, 2018.
 J. Torres-Sospedra, R. Montoliu, A. Martnez-Us, J. P. Avariento, T. J. Arnau, M. Benedito-Bordonau, and J. Huerta, “UJIIndoorLoc: A new multi-building and multi-floor database for WLAN fingerprint-based indoor localization problems,” in 2014 Int. Conf. on Indoor Positioning and Indoor Navigation (IPIN), Oct 2014, pp. 261–270.
 M. Youssef and A. Agrawala, “The Horus WLAN location determination system,” in Proceedings of the 3rd international conference on Mobile systems, applications, and services. ACM, 2005, pp. 205–218.
 Y. Xie, Y. Wang, A. Nallanathan, and L. Wang, “An improved k-nearest-neighbor indoor localization method based on spearman distance,” IEEE Signal Processing Letters, vol. 23, no. 3, pp. 351–355, March 2016.
 X. Wang, L. Gao, S. Mao, and S. Pandey, “CSI-based fingerprinting for indoor localization: A deep learning approach,” IEEE Transactions on Vehicular Technology, vol. 66, no. 1, pp. 763–776, Jan 2017.
 M. Brunato and R. Battiti, “Statistical learning theory for location fingerprinting in wireless lans,” Computer Networks, vol. 47, no. 6, pp. 825–845, 2005.
 X. Wang, L. Gao, and S. Mao, “CSI phase fingerprinting for indoor localization with a deep learning approach,” IEEE Internet of Things Journal, vol. 3, no. 6, pp. 1113–1123, Dec 2016.
 X. Wang, X. Wang, and S. Mao, “Deep convolutional neural networks for indoor localization with CSI images,” IEEE Transactions on Network Science and Engineering, pp. 1–1, 2018.
 D. P. Kingma and J. Ba, “Adam: A method for stochastic optimization,” 2014.
 A. Ignatov, R. Timofte, W. Chou, K. Wang, M. Wu, T. Hartley, and L. Van Gool, “Ai benchmark: Running deep neural networks on android smartphones,” in Proceedings of the European Conference on Computer Vision (ECCV), 2018.