Acquiring Original Data from Shredded Documents Using Neural Network

Show simple item record

dc.contributor.author Liyanage, C.R.
dc.contributor.author Jayasinghe, P.K.S.C.
dc.date.accessioned 2024-07-19T05:21:37Z
dc.date.available 2024-07-19T05:21:37Z
dc.date.issued 2018-03-07
dc.identifier.citation Liyanage, C. R. & Jayasinghe, P. K. S. C. (2018). Acquiring Original Data from Shredded Documents Using Neural Network. 15th Academic Sessions, University of Ruhuna, Matara, Sri Lanka, 94.
dc.identifier.issn 2362-0412
dc.identifier.uri http://ir.lib.ruh.ac.lk/handle/iruor/17075
dc.description.abstract Shredding is a common way to eliminate documents containing sensitive and confidential data for legal and ethical purposes in different areas such as military applications, scientific inventions, businesses and government organizational applications. Hence acquiring original data from shredded documents is an important part of investigative sciences. Performing recovering process of the shredded documents manually can result in frequent human errors. Therefore, developing a computer-based recovery algorithm is of vital importance in obtaining original data efficiently and accurately. This study investigated an approach of using Neural Network (NN) in reconstructing shredded documents, which takes a scanned image consisting of document strips and outputs the fully or partially recovered original document as another image. The model was developed on single-sided, black and white typed text documents having maximum up to 20 vertical strips from each page. The process consisted of two main steps, namely pre-processing and reconstruction. In pre-processing, paper strips were scanned into an image, and then several image processing techniques were performed to gain shreds into different images without backgrounds. In Reconstruction, a correlation vector was obtained among each shred pairs by comparing the average neighbourhood intensities for each pixel along vertical edges of shreds. This correlation vector was prepared according to the data format suitable for NN. The proposed feed forward back propagation NN model was trained and tested to automatically find the best matching adjacent shred by feeding the correlation vectors as the inputs. This model increases the ability of addressing the vagueness of the algorithm. The algorithm was validated by reconstructing a considerable number of single documents and was proved to be effective with an average resembling frequency of ~78%. The main advantages of the proposed method over the existing approaches are the simplicity, fastness, less human intervention and increased predictive and intelligent characteristics. en_US
dc.language.iso en en_US
dc.publisher University of Ruhuna, Matara, Sri Lanka. en_US
dc.subject Algorithm en_US
dc.subject Original data en_US
dc.subject Neural Network en_US
dc.subject Shredding en_US
dc.title Acquiring Original Data from Shredded Documents Using Neural Network en_US
dc.type Article en_US


Files in this item

This item appears in the following Collection(s)

Show simple item record

Search DSpace


Browse

My Account