Computer Optics

Number 47/1, 2023

5-15
Sharp focusing of on-axis superposition of a high-order cylindrical vector beam and a beam with linear polarization

Authors: V.V. Kotlyar, S.S. Stafeev, V.D. Zaitsev

Abstract

View in PDF

Number of views: 24

In this work, the sharp focusing of a laser beam whose initial polarization pattern is formed by superposition of a cylindrical mth-order vector beam and a homogeneous linearly polarized beam is considered theoretically and numerically. Although in the source plane of such a beam both the angular spin momentum and the third Stokes parameter are equal to zero, we reveal that given odd m, subwavelength local regions are formed in the focal plane, where transverse vortex energy flows occur and the third Stokes parameter (the on-axis component of the angular spin momentum) is non-zero. Thus, at odd m, at the focus of such a beam there are – sub-regions with elliptical polarization of light with alternating handedness in the adjacent sub-regions (clockwise and counterclockwise). This phenomenon can be interpreted as a variant of an optical Hall effect. We note that at even m, the field at the focus is linearly polarized at every point and no transverse energy flow is observed.

16-26
Modeling of spontaneous emission in presence of cylindrical nanoobjects: the scattering matrix approach

Authors: V.V. Nikolaev, E.I. Girshova, M.A. Kaliteevski

Abstract

View in PDF

Number of views: 20

We propose a method of analysis of spontaneous emission of a quantum emitter (an atom, a luminescence center, a quantum dot) inside or in vicinity of a cylinder. At the focus of our method are analytical expressions for the scattering matrix of the cylindrical nanoobject. We propose the approach to electromagnetic field quantization based of eigenvalues and eigenvectors of the scattering matrix. The method is applicable for calculation and analysis of spontaneous emission rates and angular dependences of radiation for a set of different systems: semiconductor nanowires with quantum dots, plasmonic nanowires, cylindrical hollows in dielectrics and metals. Relative simplicity of the method allows obtaining analytical and semi-analytical expressions for both cases of radiation into external medium and into guided modes.

27-35
Spatial and time characteristics of a four-wave radiation converter in a parabolic waveguide with resonant nonlinearity

Authors: E.V. Vorobeva, V.V. Ivakhnik, D.R. Kapizov

Abstract

View in PDF

Number of views: 20

Spatial and temporal characteristics of a degenerate four-wave converter in a multimode waveguide with resonant nonlinearity in a scheme with counter-pumping waves are analyzed using the time response function and the point spread function. For single-mode pump waves with equal mode numbers, the dependences of the time response width on the waveguide length, the intensity of the first pump waves, and the mode number in the mode expansion of the object wave amplitude are obtained for the four-wave converter. The greatest contribution to the object wave amplitude is shown to be from the waveguide mode whose number coincides with the mode number of single-mode pump waves. For the stationary model, taking into account the spatial structure of the Gaussian pump wave leads to a monotonous decrease with a decrease in the pump beam width, followed by a constant value of the PSF module width. With single-mode pump waves with equal mode numbers, An increase in the mode number of the pump waves leads to a redistribution of energy concentrated in the side maxima of the point signal image and improvement in the quality of the wavefront reversal for a model with single-mode pump waves with equal mode numbers.

36-39
Reverse energy flow in vector modes of optical fibers

Authors: S.S. Stafeev, A.D. Pryamikov, G.K. Alagashev, V.V. Kotlyar

Abstract

View in PDF

Number of views: 21

In this paper, the propagation of a second-order cylindrical vector beam in gradient-index and microstructured fibers is numerically simulated using the RSoft Fullwave software. The second-order vector beams are shown to be vector modes of these fibers. In the calculated fundamental modes, regions are found in which there is an energy flow directed oppositely to the beam propagation direction (regions of a reverse energy flow). The absolute value of the longitudinal component of the reverse energy flow is found to be much lower than that of the forward flow.

40-47
Design of optical elements for an extended light source

Authors: E.V. Byzov, L.L. Doskolovich, S.V. Kravchenko, M.A. Moiseev, N.L. Kazanskiy

Abstract

View in PDF

Number of views: 25

Using the previously developed optimization method for an extended light source [Byzov EV, Kravchenko SV, Moiseev MA, Bezus EA, Doskolovich LL. Optimization method for designing double-surface refractive optical elements for an extended light source. Opt Express 2020; 28(17): 24431-24443. DOI: 10.1364/OE.400609], we designed a compact refractive optical element (the ratio of the element height to the light source size being 1.55) providing a uniform illuminance distribution in a shifted rectangular region. An application of the optimization method for calculating the so-called TIR-elements, exploiting the phenomenon of the total internal reflection of rays, is considered. For an extended light source, compact TIR-elements with freeform exit surfaces that generate uniform illuminance distributions in a rectangular region are designed. The results of the work show promise for a wide class of problems of designing compact optical elements for light-emitting diodes.

48-52
RGB color camera for dynamical measurements of high temperature distribution on a surface of the heated solid

Authors: K.M. Bulatov, P.V. Zinin, A.A. Bykov, I.V. Malykhina

Abstract

View in PDF

Number of views: 21

In this report we describe a fast 3-color method of the measurement of temperature distributions on a surface of a heated solid using a RGB color camera with a high frame rate (100 images per second). Statistical error the RGB method is not high, and do not exceed around 5.5% which is surprising taking in to account the number of the measurements at each pixel. Comparison of the results of the temperature measurements on a tungsten plate heated by infra-red laser radiation and conducted with this technique and those obtained with the acousto-optical tunable filter technique demonstrate that error of the temperature measured by 3-color method is only two times as high as that of the tandem acousto-optic filter technique method.

53-61
Computational and experimental studies on SnO2 thin films at various temperatures

Authors: K. Gurushankar, M. Grishina, M. Gohulkumar, K. Kannan

Abstract

View in PDF

Number of views: 22

Tin oxide (SnO2) thin films was prepared by dip-coating technique at various bath temperatures (313, 333, 353 and 373 K) and annealed at 673 K in this study. And the obtained results were studied and correlated with the computational method. Scanning electron microscopy (SEM) investigation demonstrated that the prepared samples are spherical with agglomeration. The elemental analysis (EDAX) confirms the presence of Sn and O. Further, the SnO2 thin films microstructures are simulated, their thermodynamic and surface properties have been calculated. Micro-Raman spectra were recorded for the prepared samples. Micro-Raman results exhibit the first-order Raman mode E1gsub> (475 cm−1) indicating that the grown SnO2 belongs to the rutile structure. In addition, the envelope method used for studying optical characteristics of the thin films from the transmittance spectra. The semiconducting nature of the films has been noticed from linear I-V characteristics. Furthermore, the electrical conductivity studies suggest that the highest conductivity samples acquire the lowest activation energy and their values are also in the semiconducting range.

62-67
Optimization, fabrication and characterization of a binary subwavelength cylindrical terahertz lens

Authors: S.I. Kharitonov, V.S. Pavelyev, N.L. Kazanskiy, Y.S. Strelkov, K.N. Tukmakov, A.S. Reshetnikov, S.V. Ganchevskaya, V.V. Gerasimov, B.A. Knyazev

Abstract

View in PDF

Number of views: 17

A problem of optimizing the subwavelength microrelief of a binary cylindrical transmissive diffractive lens (DL) with a 300-mm focal length for a wavelength of λ=141 μm was considered. High-resistivity silicon was chosen as the DL substrate material. The angle of incidence of the illuminating beam was taken to be π/6. The optimization parameters were the height of the DL profile and the fill factor of the groove. The main goal of optimizing the design was to increase the diffraction efficiency of the lens. The DL diffraction efficiency was calculated using a Fourier mod method. The DL was fabricated by plasma-chemical etching (Bosch process) of the surface of a silicon substrate. The diffraction efficiency of the calculated lens was estimated to be 70%. However, a full-scale experiment showed the real efficiency to be much lower. These differences are related to both errors in the manufacturing process of the DL and non-ideal thickness parameters of the silicon wafers.

68-78
Development of digital image processing algorithms based on the Winograd method in general form and analysis of their computational complexity

Authors: P.A. Lyakhov, N.N. Nagornov, N.F. Semyonova, A.S. Abdulsalyamova

Abstract

View in PDF

Number of views: 22

The fast increase of the amount of quantitative and qualitative characteristics of digital visual data calls for the improvement of the performance of modern image processing devices. This article proposes new algorithms for 2D digital image processing based on the Winograd method in a general form. An analysis of the obtained results showed that the use of the Winograd method reduces the computational complexity of image processing by up to 84% compared to the traditional direct digital filtering method depending on the filter parameters and image fragments, while not affecting the quality of image processing. The resulting Winograd method transformation matrices and the algorithms developed can be used in image processing systems to improve the performance of the modern microelectronic devices that carry out image denoising, compression, and pattern recognition. Research directions that show promise for further research include hardware implementation on a field-programmable gate array and application-specific integrated circuit, development of algorithms for digital image processing based on the Winograd method in a general form for a 1D wavelet filter bank and for stride convolution used in convolutional neural networks.

79-91
Correction of rotational blur in images of stars observed by an astroinertial attitude sensor against the background of the daytime sky

Authors: N.N. Vasilyuk

Abstract

View in PDF

Number of views: 22

A rotational blur correction algorithm is considered as the initial stage of image processing in the problem of attitude measurement using a star tracker. To implement this algorithm, the star tracker must be equipped with a three-axis gyroscope. The algorithm does not guarantee the detection of an image of a star against the background of the daytime sky in one frame but facilitates conditions for subsequent image stacking. The correction aims to localize energy maxima of the blurred star images in pixels with predetermined characteristics. The correction highlights these pixels against the background and improves the signal-to-noise ratio, though deteriorating the artistic quality of the whole digital image. The key characteristic of the pixel of maximum localization is that it is where the geometric image of the star is found at the start of the exposure of the frame under correction. The correction is performed in the form of frame processing with a digital finite-impulse-response (FIR) filter. The impulse response of the filter is inhomogeneous and represents a core of rotational blur, synthesized in each pixel of the corrected frame. Algorithms for calculating levels of the signal, background, and noise in the image of a star observed against the background of the daytime sky with a rotating camera are described. Dependences of the signal-to-noise ratios in various pixels of a blurred image on the exposure time and on the angular velocity of the camera rotation are analyzed. The signal-to-noise ratios in the star image before and after the blur correction are calculated. The simulation results are illustrated by the example of an image of a bright star, clearly showing specific features of the proposed rotational blur correction algorithm.

92-101
Color consistency method for cameras with unknown model

Authors: S. Bibikov, M. Petrov, A. Alekseyev, M. Aliyev, R. Paringer, Ye. Goshin, P. Serafimovich, A. Nikonorov

Abstract

View in PDF

Number of views: 23

Modern methods of computational photography make it possible to bring the quality of images obtained by mobile cameras closer to the quality of professional cameras. One of the most important tasks is that of ensuring the consistency of colors from different cameras. In this paper, we propose a simple and efficient way to bring the colors of one camera to another, based on the approximation of the required transformation by a tone correction spline and a color transformation matrix. An experimental study was carried out in a rather complicated case, in which it was required to match colors of the images obtained from two fundamentally different sensors, as well as using diffractive optics. The results of the experiments showed that the proposed method allows one to obtain a higher accuracy of color matching between cameras than existing analogues.

102-111
Iterative algorithm for accurate superposition of contours with non-uniform sampling step

Authors: R.R. Diyazitdinov

Abstract

View in PDF

Number of views: 23

In this article, we describe an iterative algorithm for accurate superposition of contours with non-uniform sampling step. The processing contours are characterized by the same shape, but the sampling step is non-uniform, with no matching between points of the superposed contours. This makes impossible the use of methods for estimating superposition parameters by matching points. The algorithm proposed herein allows estimating the offsets and rotation angle separately. The idea of the algorithm is to perform the iterative correction of parameters. An estimate of the offsets is used to estimate the rotation angle and, vice versa, an estimate of the rotation angle is used to estimate the offsets. The proposed algorithm is characterized by a higher speed of processing than a brute force algorithm and a lower estimation error than algorithms that analyze contour macroparameters.

112-117
Detection of surface defects in welded joints during visual inspections using machine vision methods

Authors: M.G. Yemelyanova, S.S. Smailova, O.E. Baklanova

Abstract

View in PDF

Number of views: 21

We discuss a problem of automatic defect detection in welded joints of stainless steel pipes in the production process. Possible defects that occur during tungsten inert gas welding are shown. The substantiation of the choice of the method for solving the problem based on modeling and background subtraction is given. An algorithm for defect detection in welded joints on frames of video sequences is proposed, taking into account the features of a specific area. The background models are built using the methods of averaging and a mixture of Gaussians. Experimental studies of the algorithm are carried out using examples of processing frames of video sequences received from a static camera. The obtained results confirm that the background modeling method based on frame averaging is suitable for the automatic detection of welding defects since the defects are different and have characteristic features. The proposed algorithm makes it possible to detect and highlight the defective area in a welded joint on frames of video sequences. The experimental results show that the algorithm satisfies the requirements for continuous rapid detection of surface defects.

118-125
Semantic segmentation of rusts and spots of wheat

Authors: I.V. Arinichev, S.V. Polyanskikh, I.V. Arinicheva

Abstract

View in PDF

Number of views: 27

The paper explores the possibility of semantic segmentation of the yellow rust and wheat blotch classification using the U-Net convolutional neural network architecture. Based on an own dataset of 268 images, collected in natural conditions and in infectious nurseries of the Federal Research Center for Biological Plant Protection (VNII BZR), it is shown that the U-Net architecture with ResNet decoders is able to qualitatively detect, classify and localize rust and spotting even in cases where diseases are present on the plant at the same time. For individual classes of diseases, the main metrics (accuracy, micro-/macro precision, recall, and F1) range from 0.92 to 0.96. This indicates the possibility of recognizing even a few diseases on a leaf with an accuracy that is not inferior to that of a plant pathology expert. The IoU and Dice segmentation metrics are 0.71 and 0.88, respectively, which indicates a fairly high quality of pixel-by-pixel segmentation and is confirmed by visual analysis. The architecture of the neural network used in this case is quite lightweight, which makes it possible to use it on mobile devices without connecting to the network.

126-136
An approach to detecting and eliminating spatial contour artifacts in Web GIS applications

Authors: A.V. Vorobev, G.R. Vorobeva

Abstract

View in PDF

Number of views: 25

One of the common problems of modern geoinformation libraries and interfaces when constructing spatial isolines is the presence of multiple artifacts in the resulting set, in particular, open level lines. As a result, the formed set of spatial isolines after the web rendering procedure makes it difficult to analyze the spatial distribution of the corresponding parameters, on the one hand, and reduces the quality of spatial image rendering, on the other. At the same time, artifacts of spatial isolines are especially critical for large amounts of data. The paper proposes an approach that makes it possible to correct software-generated isolines by identifying open lines and their subsequent selective connection. From the point of view of software implementation, the presented approach practically does not change the response time of server scripts. The effectiveness of the developed approach is confirmed by the example of a web application that provides visualization in the form of a set of spatial isolines of geophysical parameters in the auroral oval region.

137-151
Central Russia heavy metal contamination model based on satellite imagery and machine learning

Authors: A. Uzhinskiy, K. Vergel

Abstract

View in PDF

Number of views: 18

Atmospheric heavy metal contamination is a real threat to human health. In this work, we examined several models trained on in situ data and indices got from satellite images. During 2018-2019, 281 samples of naturally growing mosses were collected in the Vladimir, Yaroslavl, and Moscow regions in Russia. The samples were analyzed using Neutron Activation Analysis to get the contamination levels of 18 heavy metals. The Google Earth Engine platform was used to calculate indices from satellite images that represent summarized information about sampling sites. Statistical and neural models were trained on in situ data and the indices. We focused on the classification task with 8 levels of contamination and used balancing techniques to extend the training data. Three approaches were tested: variations of gradient boosting, multilayer perceptron, and Siamese networks. All these approaches produced results with minute differences, making it difficult to judge which one is better in terms of accuracy and graphical outputs. Promising results were shown for 9 heavy metals with an overall accuracy exceeding 89%. Al, Fe, and Sb contamination was predicted for 3,000 and 12,100 grid nodes on a 500 km2 area in the Central Russia region for 2019 and 2020. The results, methods, and perspectives of the adopted approach of using satellite data together with machine learning for HM contamination prediction are presented.

152-159
Joint analysis of radiological reports and CT images for automatic validation of pathological brain conditions

Authors: Y.D. Agafonova, A.V. Gaidel, P.M. Zelter, A.V. Kapishnikov, A.V. Kuznetsov, E.N. Surovtsev, A.V. Nikonorov

Abstract

View in PDF

Number of views: 19

We consider a problem of validation of radiological medical reports and computed tomography images for an automated analysis of brain structures. Two methods for solving the problem are proposed: a method based on the ruCLIP multimodal model, and a method based on the joint use of two separate classifiers – for a text report and for a brain CT image. We discuss methods evaluation and the obtained results. The proposed approaches make it possible to correctly classify 99.6% of radiological reports from a test sampling into 15 possible diagnoses.

160-169
A new approach to training neural networks using natural gradient descent with momentum based on Dirichlet distributions

Authors: R.I. Abdulkadirov, P.A. Lyakhov

Abstract

View in PDF

Number of views: 24

In this paper, we propose a natural gradient descent algorithm with momentum based on Dirichlet distributions to speed up the training of neural networks. This approach takes into account not only the direction of the gradients, but also the convexity of the minimized function, which significantly accelerates the process of searching for the extremes. Calculations of natural gradients based on Dirichlet distributions are presented, with the proposed approach introduced into an error backpropagation scheme. The results of image recognition and time series forecasting during the experiments show that the proposed approach gives higher accuracy and does not require a large number of iterations to minimize loss functions compared to the methods of stochastic gradient descent, adaptive moment estimation and adaptive parameter-wise diagonal quasi-Newton method for nonconvex stochastic optimization.

170-178
Classification of surface defects in the base metal of pipelines based on complex diagnostics results

Authors: N.P. Aleshin, S.V. Skrynnikov, N.V. Krysko, N.A. Shchipakov, A.G. Kusyy

Abstract

View in PDF

Number of views: 24

We discuss issues of classification of operational volumetric and planar surface defects based on the results of complex diagnostics by non-destructive ultrasonic sounding using Rayleigh surface waves generated by an electromagnetic-acoustic transducer and the eddy current method. The paper presents results of feature selection using a variance analysis (ANOVA) and an Extra Trees Classifier algorithm, making it possible to select an optimal eddy current transducer for surface defect classification. The classification of surface defects by the amplitude of ultrasonic and eddy current signals, as well as the phase of the eddy current signal separately is shown to be unambiguous. Models for classifying surface defects as being volumetric or planar are constructed based on statistical methods such as Bayesian inference and the Dempster-Schafer theory. The workability of the constructed classification models is evaluated using metrics such as the Jaccard coefficient and the F1-measure.

179-184
Research on robot motion control and trajectory tracking based on agricultural seeding

Authors: L.L. Chen

Abstract

View in PDF

Number of views: 18

With the development of science and technology, agricultural production has been gradually industrialized, and the use of robots instead of humans for seeding is one of the agricultural industrializations. This paper studied the seeding path planning and path tracking algorithms of the seeding robot, carried out experiments, and compared the improved proportion, integral, differential (PID) algorithm with the traditional PID control algorithm. The results demonstrated that both the improved and non-improved control algorithms played a good role in tracking on the straight path, but the improved control algorithm had a better tracking effect on the turning path; the displacement deviation and angle deviation of the tracking trajectory of the improved PID algorithm were reduced faster and more stable than the traditional PID algorithm; the tracking trajectory was shorter and the operation time of the robot was less under the improved PID algorithm than the traditional one.

185-195
Many heads but one brain: FusionBrain – a single multimodal multitask architecture and a competition

Authors: D.D. Bakshandaeva, D.V. Dimitrov, V.S. Arkhipkin, A.V. Shonenkov, M.S. Potanin, D.K. Karachev, A.V. Kuznetsov, A.D. Voronov, A.A. Petiushko, V.F. Davydova, E.V. Tutubalina

Abstract

View in PDF

Number of views: 25

Supporting the current trend in the AI community, we present the AI Journey 2021 Challenge called FusionBrain, the first competition which is targeted to make a universal architecture which could process different modalities (in this case, images, texts, and code) and solve multiple tasks for vision and language. The FusionBrain Challenge combines the following specific tasks: Code2code Translation, Handwritten Text recognition, Zero-shot Object Detection, and Visual Question Answering. We have created datasets for each task to test the participants' submissions on it. Moreover, we have collected and made publicly available a new handwritten dataset in both English and Russian, which consists of 94,128 pairs of images and texts. We also propose a multimodal and multitask architecture – a baseline solution, in the centre of which is a frozen foundation model and which has been trained in Fusion mode along with Single-task mode. The proposed Fusion approach proves to be competitive and more energy-efficient compared to the task-specific one.