The aim of this paper is to detect fraud in telecommunications data which consists of millions of call records generated each day. The fraud detection is implemented via the construction of user call profiles using the calls detail records (CDR) data. This paper attempts to investigate the reliability of the unsupervised Random Forest method in building the profiles using its variable importance measure. Four different simulation scenarios, using different number of variable selection in each node of the tree, are performed.
3rd Annual ASEARC Research Conference. ASEARC: Proceedings of the Third Annual ASEARC Research Conference (Newcastle, N.S.W. 7-8 December, 2009)