Designing and testing scene enhancement algorithms for patients with retina degenerative disorders
© Al-Atabany et al; licensee BioMed Central Ltd. 2010
Received: 14 November 2009
Accepted: 18 June 2010
Published: 18 June 2010
Retina degenerative disorders represent the primary cause of blindness in UK and in the developed world. In particular, Age Related Macular Degeneration (AMD) and Retina Pigmentosa (RP) diseases are of interest to this study. We have therefore created new image processing algorithms for enhancing the visual scenes for them.
In this paper we present three novel image enhancement techniques aimed at enhancing the remaining visual information for patients suffering from retina dystrophies. Currently, the only effective way to test novel technology for visual enhancement is to undergo testing on large numbers of patients. To test our techniques, we have therefore built a retinal image processing model and compared the results to data from patient testing. In particular we focus on the ability of our image processing techniques to achieve improved face detection and enhanced edge perception.
Results from our model are compared to actual data obtained from testing the performance of these algorithms on 27 patients with an average visual acuity of 0.63 and an average contrast sensitivity of 1.22. Results show that Tinted Reduced Outlined Nature (TRON) and Edge Overlaying algorithms are most beneficial for dynamic scenes such as motion detection. Image Cartoonization was most beneficial for spatial feature detection such as face detection. Patient's stated that they would most like to see Cartoonized images for use in daily life.
Results obtained from our retinal model and from patients show that there is potential for these image processing techniques to improve visual function amongst the visually impaired community. In addition our methodology using face detection and efficiency of perceived edges in determining potential benefit derived from different image enhancement algorithms could also prove to be useful in quantitatively assessing algorithms in future studies.
There are thought to be 38 million people suffering from blindness worldwide , and this number is expected to double over the next 25 years. Additionally, there are 110 million people who have severely impaired vision. The low vision pathologies of this latter group can be divided mainly into two categories; those that predominantly suffer from a loss of visual acuity due to macular degenerations, and those that predominantly suffer from a reduction in the overall visual field such as Retinitis Pigmentosa. In many countries, there is an increasing prevalence of diabetic retinopathy and an ageing population with 1 in 3 over the age of 75 being affected with some form of AMD .
Despite advances in treatment such as antivascular endothelial growth factor agents for exudative age related macular degeneration (wet-AMD) and medical and surgical management of glaucoma, there are still a significant number of conditions which lead to severe sight loss. Dry AMD, and untreatable diabetic retinopathy, as well as inherited retinal degenerations such as Retinitis Pigmentosa (RP) are significant examples of these .
People with visual acuity impairment suffer from a range of problems affecting their mobility and quality of life .
Electronically enhanced visual aids have been proposed which offer a number of distinct advantages over conventional low vision aids in low vision rehabilitation . Prothero  overlaid virtual cues on the real scene, to improved the mobility of patients with Akinesia. Massof and Rickman  developed a low vision imaging system ("ELVIS") at Johns Hopkins University, which mainly provides magnification and contrast enhancement. Wolffsohn  overlaid edges on the original scenes, to enhance the television viewing. Although the results were good, using the Gaussian filter as a scene pre-smoothing before extracting the edges blurs the important features as well as the irrelevant textures.
The Harvard Vision Rehabilitation Lab group has published numerous papers in the field of vision rehabilitation. They multiplexed minified edges over the original scene on a see-through display . However, there is the potential for inattentional blindness, which is the inability of observers to maintain awareness of events in more than one of two superimposed scenes. Apfelbaum , tested the effect of vision-multiplexing in reducing the inattentional blindness phenomena, but he found that it does not have any positive or negative effect on reducing the inattentional blindness. Also, Fullerton et al  and Peli et al [12, 13] have tackled the problem of enhancing television images by overlaying extracted edges on the original images. However, patients reported some inconvenience due to the appearance of randomly highlighted pixels which was due to enhancing the noise as well as the major objects. Fernando , recently developed a portable aiding system by applying a digital zooming and edge enhancement to the scene, especially aimed at patients with RP.
Most of the work described above has been based on two main techniques; image resizing and edge overlay. However, both approaches rely on edge extraction techniques which can amplify irrelevant information such as noise or textural detail in addition to significant features. Everingham  tackled the irrelevant information enhancement problem by classifying the objects in the scene into eight main colored objects. Classification allowed separation of those objects from irrelevant details. The limitation is that the scene can only be separated on the basis of eight pre-defined objects with losing the ability to see the natural color information of the visual scene.
Image enhancement and segmentation algorithms have been progressively developed in the field of medical image processing [16–19]. However, scalability and implementability of these algorithms on portable and low power consumption devices is our main concern .
From the literature it is clear that there is a lack of objective assessment tools to quantitatively evaluate novel image enhancement methods, unless testing them on large numbers of patients. In this paper we describe three image enhancement techniques developed for patients with low vision due to retinal degeneration. To test these techniques, we have built a retinal image processing model of the degenerate retina to assess the degradation of the visual information. In our model, we aim to understand what information is transferred to the visual cortex rather than assess detailed low-level synaptic processing. Thus, our model aims to replicate the main centre-surround and color opponent spatial information processing tasks of the retina. We then reconstruct the image to assess loss of information and the impact of any visual defects. Using our mode, we create a virtual scotomata and asses its impact on the original and enhanced scenes. In order to form quantitative, we have used face detection as a key visual task. Using this it is possible to assess the effect of different image enhancements for different types and severity of retinal degeneration.
In this paper: we developed three image enhancement techniques which are Image Cartoonization, Edge Overlaying and Tinted Reduced Outlined Nature(TRON) algorithms, which we have tested on both patients in trials and using our model. Image Cartoonization has previously been described in the image processing community [21, 22]; here we describe its first use on patients with retinal degenerations. In the case of edge overlay, we have described an enhancement of this technique to improve segmentation of key features and removal of unnecessary ones. Finally, we present the TRON algorithm and its use in patients, which we believe will have advantages over edge only images as it maintains chromatic information.
A) Image enhancement algorithms
Human vision has its highest resolution with best visual acuity located at the fovea in the central macula much of the spatial processing of the visual cortex is designated to these regions. Patients with degeneration of the fovea and macula perceive an extreme blurred vision or a scotoma. In image processing terminology, there is a loss in the high frequency components of visual information. In addition, low contrast images can be particularly problematic. In order to improve this, our intention is to enhance the key features in the scene so as to enhance the effective contrast of the key features. As transferability to portable processing platforms is important, we have not attempted any form of saliency. Instead we use processing functions similar to those carried out by the retina and lower levels of the visual cortex which can be implemented on power efficient portable processing platforms [20, 23].
Simplification of the scene, using anisotropic filtering.
Extraction of the significant spatial derivatives, using a hierarchy method.
Boosting the original scene using the simplified spatial derivatives.
Image simplification is an important step before performing spatial derivatives (edge extraction) so as not to extract high frequency noise and textures . Gaussian filtering is a commonly used kernel for this purpose . While it is effective at noise removal, it removes high frequency information, thus blurring the edges of the significant object boundaries.
Median filtering can be used to remove speckle noise. It is applied uniformly across an image, smoothing all pixels which appear to be considerably different to their neighbours. Thus while, it is very effective in the elimination of speckle noise, it is often at the expense of a slight blurring of the scene .
Where n denotes the iteration number, Δt is the time step (it controls the accuracy and the speed of the smoothing) and ∇IH, ∇IV, ∇ID1, ∇ID2 represents the gradient in four directions.
As simple high frequency (small kernel) derivatives of this form can be lossy in their boundary detection, we use a multi-scale pyramidal approach to obtain lower frequency (large kernel) derivatives. This is the equivalent of using multiple higher order kernels, but is more efficient in processing terms .
Simplification of the image with anisotropic filtering
Calculating the spatial derivatives of the image
Quantization of the colors of the simplified image to create cartoon like images
Combining the quantized image with the negative of the gradient map
The Algorithm starts by smoothing the original image using the above anisotropic diffusion filter as described in equations (1) to (3), above. The anisotropic diffusion is applied to the color image by converting it to the YCbCr color space , after that the Y (the intensity channel) is diffused. Then the YCbCr image is converted back to the RGB format. The gradient image calculated as given in equations (6) above, and normalized between 0 and 1. We then define two threshold values, τ min , τ max and we set all pixels of the normalized gradient image below τ min to 0 and all the pixels above τ max are set to 1.
Q(x) is the quantized image, Δq is the bin width, q(x)nearest is the closest bin color to the current pixel f(x) and φq is a matrix used to control the sharpness transition between one bin to another. The full description of this method has been described previously by Winnemoller et al  and is presented in more detail in the Appendix.
Edge Overlaying Algorithms
The edge overlay algorithms use the same mathematics as those previously described. Here, we recolor and overlay gradient map onto either the original image, or a simplified version of the original image. Thus contrast should be improved compared to the original.
B) Degenerate retina model
There has been considerable previous work on modeling the human retina dating from Hubel and Wiesel . The majority of this literature focuses on physiological aspects of the retinal function . Other objectives include models for retinomorphic imaging systems which aim to mimic the human eye , and models for electronic retinal prostheses . The majority of these models have been focused on modeling the normal retina rather than determining how information is distorted in the case of retinal degeneration.
Our model focuses on the centre surround spatial processing function of the retina. We reconstruct the chromatic and achromatic spatial information pathways being sent to the visual cortex. We can then reconstruct these for assessment of the visual information content.
The human retina is composed of several layers, organized in a highly structured network that extracts and pre-processes visual information from the image projected upon it [35, 36]. Visual perception starts with image capture by the rod (achromatic, scotopic photoreceptors) and cones (chromatic - photopic photoreceptors). Our model ignores the difference between scotopic and photopic ranges as most imaging systems can only record with 8-bits of dynamic range and generally perform poorly at low light. We therefore separate the image matrix into achromatic (rod), blue (s-cone), green (m-cone), and red (L-cone) and yellow (for convenient opponent processing).
The centre surround spatial processing function in the retina results from the arrangement of the bipolar cells in particular and their connectivity to the retinal ganglion cells. The Horizontal cells perform smoothing and automatic gain control, the former can be performed through simple Gaussian filtering and the latter through histogram equalization. The amacrine cells are active in the achromatic and temporal processing, though we do not implement the latter as for this work we are investigating still images. The retina has two main visual pathways which transmit visual information to the visual cortex:
The parvocellular pathway (P), which is responsible for transmission of chromatic spatial features, and is the dominant pathway from the central vision.
The magnocellular pathway (M), which is responsible for achromatic and low-light spatio-temporal feature detection and is dominant in the peripheral vision.
A. Simulating foveal vision
To reduce the amount of information passing through the optical nerve to the visual cortex region, the visual system of primates has a space-variant nature where the resolution is high in the fovea and gradually decreases towards the periphery of the visual field. Effective vision is possible due to rapid scanning (saccades) of the eye across the vision scene. By this method it is possible to achieve very high resolution via the fovea, while maintaining a rapid wide field of vision. To simulate this sampling behaviour we use a multi-scale resolution sampling methods [37, 38], by dividing the image into two regions; fovea and periphery. The model has a 1:1 ratio of pixels in the fovea. The peripheral region is divided into concentric rings of equal width, which equivalent to one pixel. Each ring is blurred by a Gaussian function with kernel size growing exponentially with radial distance from the fovea. These variations with eccentricity represent the increasing size of retinal receptive fields with distance from the fovea.
Where r is the radial distance of the pixel (x, y) from the centre of the input image, and G σ (x, y) is a two dimensional Gaussian averaging filter with a standard deviation σ equal to log(r). Figure 4, shows the output of applying the eccentricity simulation on an input image.
B. Color separation
The next stage in the model after simulating the foveal-peripheral vision is to account for the color separation in the retina. Input images are 2D matrices, with RGB components. In contrast, the chromatic information in the human retina is encoded into two color opponent channels; green-red and blue-yellow, and one achromatic channel. We therefore convert to a LG R B Y color space .
The L channel represents absolute luminance and extends from 0 (black) to 100 (white). The other two channels G R and B Y represent the greenness-redness and the blueness-yellowness color opponents respectively. Negative values of G R indicate green while positive values indicate magenta; similarly, B Y negative values indicate blue and positive values indicate yellow. Pixels for which G R = B Y = 0 are achromatic and thus the L channel represents the achromatic scale of grays from black to white.
C. Horizontal layer
The horizontal cells serve as a negative feedback gain control on cone cells, adapting the reduction of glutamate release to increasing illumination. As the 8-bit dynamic range of most jpeg images is small, we consider the variation in illumination small, and thus we did not consider gain control in this model, although histogram equalization can be used to ensure optimal use of the 8-bit intensity range.
There are three types of the horizontal cells; HI (achromatic), HII and HIII (chromatic) cells. These cells have direct electrical synapses with each other and provide inhibitory feedback to the photoreceptors, with receptive field increasing towards the periphery. They are absent in the fovea. The horizontal cell function can be modeled with a diffusion process which results in a Gaussian-weighted spatial averaging of the cone inputs over the cell's RF.
Where L, R, G, B and Y represent the five separated channels: Luminance, Red, Green, Blue and Yellow.
D. Bipolar layer
Bipolar cells receive their inputs predominantly from the cones with some inhibitory feedback from the horizontal cells. ON bipolar cells depolarize with decreasing glutamate (increasing photo response) from the connecting photoreceptors, whereas OFF bipolar cells hyperpolarize. The synapses of surrounding ON and OFF bipolar cells to the retinal ganglion cells generate the centre-surround processing phenomena. In mammals, the ratio of the centre diameter field to the surround diameter one is range between 1:10 .
σ s and σ c are the surround and centre standard deviation of the Gaussian filter. The ratio between the surround sigma to the centre one is considered to be 1:2, which give a reasonable agreement with the physiologically measured value . Using this ratio value results in a receptive field diameter of the surround larger than the centre diameter by 5 to 6 times.
The size of the surround Gaussian kernel is set to 5 times larger than the size of centre kernel in each ON/OFF channel. Although there is no Yellow-centre/Blue-surround processing in the retina, we have included it here for purposes of processing symmetry.
E. Image reconstruction
Where ∇2 is the Laplacian operator and ∇·G is the divergence of the gradient G. There are different methods to solve the Poisson equation; such as finite difference methods, finite element methods and spectral methods. The fastest method is to solve it by using the fast Poisson solver method, which uses the fast Fourier transform to invert the Laplacian operator . Figure 4, shows the result of a reconstructed image by solving the Poisson equations for the three opponent channels; Luminance, RG and BY channels.
F. Macular degeneration simulation
The model described above simulates a normal retina. Degeneration can be implemented by turning off photoreceptors after the foveal simulation (eccentricity simulation) but before the color separation. We can thus create scotomata similar to that found in AMD patients.
Figure 4, shows the degenerate retina model after adding the macular degeneration part and the output image of each block is shown beside each stage.
C) Patient clinical trial protocol
27 patients were tested at the Oxford Eye hospital, John Radcliffe Hospital UK with approval from the Oxfordshire ethics committee. Of the patients; 9 were diagnosed with Retinitis Pigmentosa (RP), including 1 individual with Pseudoxanthoma Elasticum and 1 individual with Leber's Hereditary Optic Neuropathy. The remaining had macular pathologies, predominantly Stargardt's disease. The average visual acuity (VA) in the better eye in this cohort was 0.63 ± 0.07 (Range: -0.26: 1.14) and the average contrast sensitivity (CS) in the better eye was 1.22 ± 0.08 (Range: 0.15:1.65). The heterogeneity of the patient conditions was to allow us to broadly determine the effect of different severities and types of retinal degeneration on our enhancement methods.
Patients were presented with 25 sets of images and 4 sets of videos sequences. Images enhanced with our algorithms were randomly interspersed to even out the effect of memory. For each image, patients were asked to identify key scene features and were asked to rank different version of each image for both ease of major feature identification, and willingness to perceive images in this way in everyday life. In the case of the video sequences, these were placed next to each other and the patients asked to give viewing preference. The images and videos were projected to the patients using the Panasonic PT-AX200E projector with resolution of 1280 × 720 and maximum projection brightness (At a distance of 2 m from the projection wall) of 2000 Lumens, in a darkened room. The distance between the patient and the wall was kept to 1.5 m and the dimension of the projected screen was 110 cm width and 79 cm height so that the field of view was maintained at 40°.
Results and Discussion
The results in this paper are divided into two sections; results obtained from testing the four image enhancement algorithms in our model, and results obtained from testing these algorithms on patients with retinal degeneration.
The inputs into the model consist of original (unprocessed), and modified scenes and video sequences. The model was varied for various degrees of severity of retinal degeneration.
1) Model simulation testing on still images
The gradient image was calculated as given in equations (6) above for each image.
Scaling the gradient image intensity between 0 and 255.
- 3)Summing up the intensities over the whole gradient image and dividing the result on the total number of pixels, according to this equation:(20)
Where G is the gradient image with dimension of M × N
From figure 7, it is highly significant that the TRON algorithm shows the highest performance in detecting and perceiving edges (P < 0.0176) followed by the Cartoonization (P < 0.0322) and edge overlaying (P < 0.0578).
In order to determine more real-world effects of our algorithms, we examined the effect of our image enhancements on faced detection. This is an important function for daily interaction and is one of the key deficits affecting low vision patients. In computer vision field, face recognition algorithms initially perform the detection of a face, followed by identification of its distinguishing characteristics to perform identification . Separate detection and recognition stages in the process of face perception in the human visual system has also been described in the neuropsychology literature . For our purposes, as recognition involves much higher levels of cortical function, we use face detection as a primary test of the image enhancement algorithms.
Detecting faces requires the extraction of features that are common to all faces. In this paper we use two separate methods to test the efficiency of our algorithms in detecting faces; the first method is the Viola-Jones method , This is a featurebased algorithm, which attempts to detect the presence of certain facial features. It uses a cascade of increasingly complex filters, or feature detectors to improve performance to give a robust but quick detection. The first filter in the cascade consists of only two simple features, each composed of a few rectangular light and dark regions. Subsequent stages of filtering are performed only on regions scoring positive at any preceding stage. The Viola-Jones algorithm uses filter templates similar to the centre-surround phenomena in the human visual perception and is therefore additionally beneficial to this work. The second method is the Kienzle  appearance-based algorithm, which uses machine learning techniques to find relevant characteristics of face and non-face images. Then it builds discriminant function (i.e., decision surface, separating hyperplane, threshold function) to discriminate between these relevant characteristics of the faces and non-faces classes. Kienzle used the Support Vector Machines SVM classifier as the discrimination or the decision surface between the faces and non-faces classes . SVM is a method to train polynomial function, neural networks, or radial basis function (RBF) classifiers. A full description of this method can be found in Kienzle paper.
We selected 14 images with different sizes of faces to give a sum total of 166 faces. Firstly, the original and processed images were degenerated by using our retina simulation model and then the output from the model fed to the Viola-Jones and Kienzle face detection algorithms. Each image underwent different levels of degeneration, starting from no degeneration to 4 times the biological macula size in increasing steps of 0.4, so in total we had 11 levels of degeneration for each image. Figure 11, shows the detected face rounded by squares in the original and cartoonized image with three levels of degeneration; no degeneration, medium degeneration and severe degeneration. Results show higher contrast around the faces of the cartoonized images.
2) Testing dynamic scenes
1) Results from the still images
Figure 16(b), shows the patients' preference to the enhanced images relative to the original version, for the two groups of patients. Group 1 who preferred 70% of the images in the processed format and Group 2 who preferred the processed images less than 70%. For Group 1, we found that image Cartoonization was the most preferable for those patients, especially for images with low contrast, luminance and feature size. This was expected, as Cartoonization increases the contrast between the foreground objects and background. Furthermore, the added negative edges in the Cartoonization process added more contrast enhancement to the relevant features. Alternatively, edge overlay was preferred for scenes with high luminance and large-major features. One possible explanation is that high luminance can cause glaring and in that case, the differentiation between scenes objects will be difficult. Hence, by making a separation between foreground objects and background with different color can be more convenient for these patients. These results conclude that, Cartoonization and edge overlaying are the best for feature detection and recognition. TRON was least chosen, because it suppresses most of the natural and color information on the scene.
2) Results from video
From these observations we can conclude that presently there is no single algorithm which can be used for all the patients in all the circumstances. However, if implemented on a wearable augmented vision headset, patients could select the appropriate algorithm given the personal preference and visual situation. For example, TRON may be most appropriate for navigation. Alternatively Cartoonization and edge overlay may be most appropriate for watching television or more static scenes.
In this article we have described three different image enhancement algorithms developed for patients suffering from retinal degenerative diseases. Additionally, we have presented an image processing model for retinal degeneration which we have used to evaluate the efficiency of these algorithms. The image processing model allows us to reconstruct the information stream towards the visual cortex and assess our algorithms using objective tests such as face detection. Results from this model show that TRON and edge overlaying algorithms are very useful in detecting spatial features in dynamic scenes and perceiving the edges of simple objects in static scenes. Image Cartoonization improves face detection. The same enhancement algorithms have also been tested on group of patients with primarily macular degenerations. When we analyzed the results from these patients we found that patients with CS range from 0.45 to 1.2 and VA greater than 0.9 derived the highest benefit from using these algorithms. This is highly consistent with the data from our model. Furthermore, the patient preference for the Cartoonization algorithm in static scenes and the TRON algorithm for dynamic scenes is also consistent with the findings from our model. In addition, to the identification of potential benefit of these two algorithms to the visually impaired, our testing methodology itself would be expected to be very useful in this field, as the ability to objectively determine efficacy of enhancement algorithms for those with low CS and VA will be beneficial for future studies. In the long run, we believe image enhancement algorithms such as that we present could perform the basis of the front end processing interface for retinal prosthesis  or new forms of visual assistive devices.
Appendix: Short description of the Cartoonization algorithm
N is the number of bins which fixed in this paper to 8 bins. The numerator in equation (23) is set to 100 because the maximum value of the luminance channel is 100. f(x) is the pixel value of the luminance channel. INT means that the luminance channel will be rounded to the nearest integer value.
In equation (8), if φ q is fixed, then the transition sharpness is independent of the underlying image, creating many noticeable transitions in large smooth-shaded regions. To minimize these transitions, φq is defined to be a function of gradient image. We allow hard bin boundaries only where the gradient is high. In low gradient regions, bin boundaries are spread out over a larger area.
According to Winnemoller et al , the sharpness range is set between [Λφ Ωφ] and the gradient range to [τ min τ max ]. The calculated gradient is clamped to [τ min τ max ] and then φ q is generated by linearly mapping the clamped gradient map to [∇φ Ωφ].
We found that setting τ min = 0.1 and τ max = 0.4 of the normalized gradient image and the sharpness range to ∇φ = 3 and Ωφ = 25 give better edge enhancement.
We would like to acknowledge and thank the Royal Society Research fund, and the EPSRC (F029241) for supporting this research. Also Mr. Walid Al-Atabany would like to thank the Egyptian government, who are sponsoring him for his PhD. We would also like to thank Anna Rudenko, the research nurse and Bushra Mushtaq, the medical retinal Fellow for helping with the patients in this study and the patients for taking part in the study.
- Foster A, Resnikoff S: The impact of Vision 2020 on global blindness. Eye 2005, 19: 1133–1135. 10.1038/sj.eye.6701973View ArticleGoogle Scholar
- Tate R, Smeeth L, Evans J, Fletcher A, Owen C, Wolfson AR: "The prevalence of visual impairment in the UK". Report commissioned by the Royal National Institute of the Blind 2005. http://www.rnib.org.uk Google Scholar
- Veraart C, Duret F, Brelén M, Oozeer M, Delbeke J: Vision rehabilitation in the case of blindness. Expert Rev Med Devices 2004, 1: 139–153. 10.1586/174344188.8.131.52View ArticleGoogle Scholar
- Elliott DB, Trukolo-Ilic M, Strong JG, Pace R, Plotkin A, Bevers P: Demographic characteristics of the vision-disabled elderly. Invest Ophthalmol Vis Sci 1997, 38: 2566–2575.Google Scholar
- Brabyn JA: New Developments in Mobility and Orientation Aids for the Blind. Ieee Transactions on Biomedical Engineering 1982, 29: 285–289. 10.1109/TBME.1982.324945View ArticleGoogle Scholar
- Prothero JD: The treatment of akinesia using virtual images. Industrial Engineering, University of Washington; 1993.Google Scholar
- Massof RW, Rickman DL: Obstacles encountered in the development of the low vision enhancement system. Optom Vis Sci 1992, 69: 32–41. 10.1097/00006324-199201000-00005View ArticleGoogle Scholar
- Wolffsohn JS, Mukhopadhyay D, Rubinstein M: Image enhancement of real-time television to benefit the visually impaired. Am J Ophthalmol 2007, 144: 436–440. 10.1016/j.ajo.2007.05.031View ArticleGoogle Scholar
- Peli E, Luo G, Bowers A, Rensing N: 22.4: Invited Paper: Augmented Vision Head-Mounted Systems for Vision Impairments. SID Symposium Digest of Technical Papers 2007, 38: 1074–1077. 10.1889/1.2785492View ArticleGoogle Scholar
- Apfelbaum HL, Apfelbaum DH, Woods RL, Peli E: Inattentional blindness and augmented-vision displays: effects of cartoon-like filtering and attended scene. Ophthalmic Physiol Opt 2008, 28: 204–217. 10.1111/j.1475-1313.2008.00537.xView ArticleGoogle Scholar
- Fullerton M, Peli E: Post Transmission Digital Video Enhancement for People with Visual Impairments. JSoc Inf Disp 2006, 14: 15–24. 10.1889/1.2166829View ArticleGoogle Scholar
- Peli E: Recognition performance and perceived quality of video enhanced for the visually impaired. Ophthalmic Physiol Opt 2005, 25: 543–555. 10.1111/j.1475-1313.2005.00340.xView ArticleGoogle Scholar
- Peli E, Kim J, Yitzhaky Y, Goldstein RB, Woods RL: Wideband enhancement of television images for people with visual impairments. Journal of the Optical Society of America a-Optics Image Science and Vision 2004, 21: 937–950. 10.1364/JOSAA.21.000937View ArticleGoogle Scholar
- Vargas-Martín F, Peláez-Coca MD, Ros E, Diaz J, Mota S: A generic real-time video processing unit for low vision. International Congress Series 2005, 1282: 1075–1079. 10.1016/j.ics.2005.05.107View ArticleGoogle Scholar
- Everingham MR, Thomas BT, Troscianko T: Head-mounted Mobility Aid for Low Vision using Scene Classification Techniques. The International Journal of Virtual Reality 1999, 3: 3–12.Google Scholar
- Comaniciu D, Meer P: Mean shift: A robust approach toward feature space analysis. Ieee Transactions on Pattern Analysis and Machine Intelligence 2002, 24: 603–619. 10.1109/34.1000236View ArticleGoogle Scholar
- Han SD, Tao WB, Wang DS, Tai XC, Wu XL: Image Segmentation Based on GrabCut Framework Integrating Multiscale Nonlinear Structure Tensor. Ieee Transactions on Image Processing 2009, 18: 2289–2302. 10.1109/TIP.2009.2025560MathSciNetView ArticleGoogle Scholar
- Teng QZ, He XH, Jiang L, Deng ZY, Wu XQ, Tao DY: Color image segmentation algorithm based on neural networks. Biomedical Photonics and Optoelectronic Imaging 2000, 4224: 109–113.View ArticleGoogle Scholar
- Grau V, Mewes AUJ, Alcaniz M, Kikinis R, Warfield SK: Improved watershed transform for medical image segmentation using prior information. Ieee Transactions on Medical Imaging 2004, 23: 447–458. 10.1109/TMI.2004.824224View ArticleGoogle Scholar
- Atabany W, Degenaar P: Parallelism to reduce power consumption on FPGA spatiotemporal image processing. Proc IEEE International Symposium on Circuits and Systems ISCAS 2008 2008, 1476–1479.View ArticleGoogle Scholar
- Raskar R, Tan K-H, Feris R, Yu J, Turk M: Non-photorealistic camera: depth edge detection and stylized rendering using multi-flash imaging. ACM Transactions on Graphics 2004, 23: 679–688. 10.1145/1015706.1015779View ArticleGoogle Scholar
- DeCarlo D, Santella A: Stylization and abstraction of photographs. In SIGGRAPH '02: Proceedings of the 29th annual conference on Computer graphics and interactive techniques. New York, USA: ACM; 2002:769–776. full_textView ArticleGoogle Scholar
- Banks DJ, Degenaar P, Toutnazou C: Low-power pulse-width-modulated neuromorphic spiking circuit allowing signed double byte data transfer along a single channel. Electronics Letters 2007, 43: 704–706. 10.1049/el:20071440View ArticleGoogle Scholar
- Atabany W, Degenaar P: A Robust Edge Enhancement Approach for Low Vision Patients Using Scene Simplification. Cairo International Biomedical Engineering Conference CIBEC 2008 2008, 1–4. full_textView ArticleGoogle Scholar
- Bovik AC: Handbook of Image and Video Processing. Orlando, FL, USA: Academic Press, Inc; 2005.Google Scholar
- Perona P, Malik J: Scale-Space and Edge-Detection Using Anisotropic Diffusion. Ieee Transactions on Pattern Analysis and Machine Intelligence 1990, 12: 629–639. 10.1109/34.56205View ArticleGoogle Scholar
- Fleck MM: Some Defects in Finite-Difference Edge Finders. Ieee Transactions on Pattern Analysis and Machine Intelligence 1992, 14: 337–345. 10.1109/34.120328View ArticleGoogle Scholar
- Canny J: A Computational Approach to Edge-Detection. Ieee Transactions on Pattern Analysis and Machine Intelligence 1986, 8: 679–698. 10.1109/TPAMI.1986.4767851View ArticleGoogle Scholar
- Jobson DJ, Rahman ZU, Woodell GA: A multiscale retinex for bridging the gap between color images and the human observation of scenes. Ieee Transactions on Image Processing 1997, 6: 965–976. 10.1109/83.597272View ArticleGoogle Scholar
- Holger W, Olsen SC, Gooch B: Real-time video abstraction. ACM Trans Graph 2006, 25: 1221–1226. 10.1145/1141911.1142018View ArticleGoogle Scholar
- Hubel DH, Wiesel TN: Republication of The Journal of Physiology (1959) 148, 574–591: Receptive fields of single neurones in the cat's striate cortex. Journal of Physiology-London 2009, 587: 2722–2732. 10.1113/jphysiol.2009.174185View ArticleGoogle Scholar
- Chang PR, Yeh BF: A Retina-Like Image Acquisition-System with Wide-Range Light Adaptation. Visual Communications and Image Processing 91: Image Processing, Pts 1 and 2 1991, 1606: 456–469.Google Scholar
- Mahowald MA: Silicon retina with adaptive photoreceptors. Volume 1473. Edited by: Mathur BP, Koch C. SPIE; 1991:52–58.Google Scholar
- Morillas CA, Romero SF, Martínez A, Pelayo FJ, Ros E, Fernández E: A design framework to model retinas. Biosystems 2007, 87: 156–163. 10.1016/j.biosystems.2006.09.009View ArticleGoogle Scholar
- Kolb H, Fernández E, Nelson R: Web Vision. 2010. [http://webvision.med.utah.edu/]Google Scholar
- Field GD, Chichilnisky EJ: Information Processing in the Primate Retina: Circuitry and Coding. Annual Review of Neuroscience 2007, 30: 1–30. 10.1146/annurev.neuro.30.051606.094252View ArticleGoogle Scholar
- Geisler WS, Perry JS: A real-time foveated multiresolution system for low-bandwidth video communication. in Proc SPIE 1998, 294–305. full_textGoogle Scholar
- Chang E-C, Mallat S, Yap C: Wavelet Foveation. Applied and Computational Harmonic Analysis 2000, 9: 312–335. 10.1006/acha.2000.0324MathSciNetView ArticleGoogle Scholar
- Dowling JE: The Retina: An Approachable Part of the Brain. Harvard University Press; 1990.Google Scholar
- Shah S, Levine MD: Visual information processing in primate cone pathways. I. A model. IEEE Trans Syst Man Cybern B Cybern 1996, 26: 259–274. 10.1109/3477.485837View ArticleGoogle Scholar
- Wang H, Chen Y, Fang T, Tyan J, Ahuja N: Gradient Adaptive Image Restoration and Enhancement. Proc IEEE International Conference on Image Processing 2006, 2893–2896.Google Scholar
- Shen J, Jin X, Zhou C, Wang CCL: Gradient based image completion by solving the Poisson equation. Comput Graph 2007, 31: 119–126. 10.1016/j.cag.2006.10.004View ArticleGoogle Scholar
- Pelli DG, Robson JG, Wilkins AJ: The Design of a New Letter Chart for Measuring Contrast Sensitivity. Clinical Vision Sciences 1988, 2: 187–199.Google Scholar
- Tsao DY, Livingstone MS: Mechanisms of Face Perception. Annual Review of Neuroscience 2008, 31: 411–437. 10.1146/annurev.neuro.30.051606.094238View ArticleGoogle Scholar
- Moscovitch M, Winocur G, Behrmann M: What Is Special about Face Recognition?: Nineteen Experiments on a Person with Visual Object Agnosia and Dyslexia but Normal Face Recognition. J Cogn Neurosci 1997, 9: 555–604. 10.1162/jocn.19184.108.40.2065View ArticleGoogle Scholar
- Viola P, Jones M: Robust Real-time Object Detection. International Journal of Computer Vision 2001.Google Scholar
- Kienzle W, Bakir GH, Franz MO, Scholkopf B: Face Detection --- Efficient and Rank Deficient. In Advances in Neural Information Processing Systems 17. Edited by: Saul LK, Weiss Y, Bottou Leo. Cambridge, MA: MIT Press; 2005:673–680.Google Scholar
- Osuna E, Freund R, Girosi F: Training Support Vector Machines: an Application to Face Detection. Computer Vision and Pattern Recognition, IEEE Computer Society Conference on 1997, 0: 130.View ArticleGoogle Scholar
- Degenaar P, Grossman N, Memon MA, Burrone J, Dawson M, Drakakis E, Neil M, Nikolic K: Optobionic vision-a new genetically enhanced light on retinal prosthesis. Journal of Neural Engineering 2009, 6: 035007. 10.1088/1741-2560/6/3/035007View ArticleGoogle Scholar
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.