Publications without Fulltext

Permanent URI for this collectionhttps://hdl.handle.net/20.500.14288/3

Browse

Search Results

Now showing 1 - 10 of 260
  • Placeholder
    Publication
    Performance measures for video object segmentation and tracking
    (IEEE-Inst Electrical Electronics Engineers Inc, 2004) Erdem, Çiğdem Eroğlu; Sankur, Bülent; Department of Electrical and Electronics Engineering; Tekalp, Ahmet Murat; Faculty Member; Department of Electrical and Electronics Engineering; College of Engineering; 26207
    We propose measures to evaluate quantitatively the performance of video object segmentation and tracking methods without ground-truth (GT) segmentation maps. The proposed measures are based on spatial differences of color and motion along the boundary of the estimated video object plane and temporal differences between the color histogram of the current object plane and its predecessors. They can be used to localize (spatially and/or temporally) regions where segmentation results are good or bad; and/or they can be combined to yield a single numerical measure to indicate the goodness of the boundary segmentation and tracking results over a sequence. The validity of the proposed performance measures without GT have been demonstrated by canonical correlation analysis with another set of measures with GT on a set of sequences (where GT information is available). Experimental results are presented to evaluate the segmentation maps obtained from various sequences using different segmentation approaches.
  • Placeholder
    Publication
    Exploring projection based mixed reality with tangibles for nonsymbolic preschool math education
    (Assoc Computing Machinery, 2019) N/A; N/A; Department of Psychology; Department of Media and Visual Arts; Department of Electrical and Electronics Engineering; Salman, Elif; Beşevli, Ceylan; Göksun, Tilbe; Özcan, Oğuzhan; Ürey, Hakan; Master Student; Researcher; Faculty Member; Faculty Member; Faculty Member; Department of Psychology; Department of Media and Visual Arts; Department of Electrical and Electronics Engineering; Graduate School of Sciences and Engineering; Graduate School of Social Sciences and Humanities; College of Social Sciences and Humanities; College of Social Sciences and Humanities; College of Engineering; N/A; N/A; 47278; 12532; 8579
    A child's early math development can stem from interactions with the physical world. Accordingly, current tangible interaction studies focus on preschool children's formal (symbolic) mathematics, i.e. number knowledge. However, recent developmental studies stress the importance of nonsymbolic number representation in math learning, i.e. understanding quantity relations without counting(more/less). To our knowledge, there are no tangible systems based on this math concept. We developed an initial tangible based mixed-reality(MR) setup with a small tabletop projector and depth camera. Our goal was observing children's interaction with the setup to guide our further design process towards developing nonsymbolic math trainings. In this paper we present our observations from sessions with four 3-to-5 year old children and discuss their meaning for future work. Initial clues show that our MR setup leads to exploratory and mindful interactions, which might be generalizable to other tangible MR systems for child education and could inspire interaction design studies.
  • Placeholder
    Publication
    Robust speech recognition using adaptively denoised wavelet coefficients
    (IEEE, 2004) Department of Electrical and Electronics Engineering; Department of Electrical and Electronics Engineering; N/A; Tekalp, Ahmet Murat; Erzin, Engin; Akyol, Emrah; Faculty Member; Faculty Member; Master Student; Department of Electrical and Electronics Engineering; College of Engineering; College of Engineering; Graduate School of Sciences and Engineering; 26207; 34503; N/A
    The existence of additive noise affects the performance of speech recognition in real environments. We propose a new set of feature vectors for robust speech recognition using denoised wavelet coefficients. The use of wavelet coefficients in speech processing is motivated by the ability of the wavelet transform to capture both time and frequency information and the non-stationary behaviour of speech signals. We use one set of noisy data, such as data with car noise, and we use hard thresholding in the best basis for denoising. We use isolated digits as our database in our HMM based speech recognition system. A performance comparison of hard thresholding denoised wavelet coefficients and MFCC feature vectors is presented.
  • Placeholder
    Publication
    Multimodal analysis of speech prosody and upper body gestures using hidden semi-Markov models
    (Institute of Electrical and Electronics Engineers (IEEE), 2013) N/A; N/A; N/A; Department of Computer Engineering; Department of Computer Engineering; Bozkurt, Elif; Asta, Shahriar; Özkul, Serkan; Yemez, Yücel; Erzin, Engin; PhD Student; PhD Student; Master Student; Faculty Member; Faculty Member; Department of Computer Engineering; Graduate School of Sciences and Engineering; Graduate School of Sciences and Engineering; Graduate School of Sciences and Engineering; College of Engineering; College of Engineering; N/A; N/A; N/A; 107907; 34503
    Gesticulation is an essential component of face-to-face communication, and it contributes significantly to the natural and affective perception of human-to-human communication. In this work we investigate a new multimodal analysis framework to model relationships between intonational and gesture phrases using the hidden semi-Markov models (HSMMs). The HSMM framework effectively associates longer duration gesture phrases to shorter duration prosody clusters, while maintaining realistic gesture phrase duration statistics. We evaluate the multimodal analysis framework by generating speech prosody driven gesture animation, and employing both subjective and objective metrics.
  • Placeholder
    Publication
    Optical modulation with silicon microspheres
    (IEEE-Inst Electrical Electronics Engineers Inc, 2009) Gürlü, Oğuzhan; N/A; Department of Physics; Yüce, Emre; Serpengüzel, Ali; Master Student; Faculty Member; Department of Physics; Graduate School of Sciences and Engineering; College of Sciences; 245435; 27855
    In this letter, a silicon microsphere coupled to a silica optical fiber half coupler has been characterized for electrooptical modulation in the L-band at 1.55 mu m. Electrooptical modulation of the transmitted and the 90 degrees elastic scattered signals for both the TE and the TM polarizations of the microsphere resonances has been observed.
  • Placeholder
    Publication
    Mid-infrared elastic scattering from germanium microspheres
    (IEEE, 2016) N/A; N/A; N/A; N/A; Department of Physics; Zakwan, Muhammad; Bayer, Mustafa Mert; Anwar, Muhammad Sohail; Gökay, Ulaş Sabahattin; Serpengüzel, Ali; PhD Student; Master Student; Master Student; PhD Student; Faculty Member; Department of Physics; Graduate School of Sciences and Engineering; Graduate School of Sciences and Engineering; Graduate School of Sciences and Engineering; Graduate School of Sciences and Engineering; College of Sciences; N/A; N/A; N/A; N/A; 27855
    Because of their ultrahigh optical nonlinearities and extremely broad transparency window, germanium microsphere resonators offer the potential for optical processing devices, especially in the mid-infrared (mid-IR) wavelengths. As a semiconductor material for microphotonics applications [1], germanium is particularly attractive owing to its large nonlinearity, high optical damage threshold compared with traditional nonlinear glass materials, and above all, its broad transparency window, extending from the near-IR into the mid-IR. Germanium based optical components have found numerous applications in imaging systems operating in the mid-IR wavelengths, where the principal natural greenhouse gases do not exhibit strong absorption. These applications include rapid sensing and diagnosis [2,] [3], industrial process controls, environmental monitors to hazardous chemical detection [4]. Germanium also is a good electromagnetic shielding material, an attribute that has become increasingly important for modern military applications, where other signals (within the millimeter and centimeter wavelength range) can be strong enough to interfere with nearby IR systems. Elastic light scattering from a germanium microsphere has already been observed in the near-IR [5]. Here, elastic light scattering from a germanium microsphere in the mid-IR region is numerically analyzed using generalized Lorenz-Mie theory (GLMT) [6]. Light interaction with microspheres of various materials is of much interest because of their photonic properties [7]. Germanium has a refractive index of 4, which is even higher than the refractive index of silicon (3.5) in the mid-IR region. The higher refractive index results in higher quality factor morphology dependent resonances (MDRs). A higher value of Q indicates a longer lifetime of the photons trapped inside the cavity and a narrower MDR. Here, the MDRs are observed numerically in the transverse magnetically (TM) and transverse electrically (TE) polarized 90° elastic scattering and 0° transmission for a 40 µm radius germanium microsphere in the mid-IR wavelengths ranging from 5.4 µm to 5.6 µm [8]. The mode spacing of approximately 41 nm between the resonances with the same radial mode order and consecutive polar mode number shows good correlation with the optical size of the germanium microsphere. The germanium microsphere with its high quality factor MDRs can be suitable for optical monitoring and sensing applications in the mid-IR, which require a high spectral resolution [9].
  • Placeholder
    Publication
    Ruby microsphere and liquid cyrstal based tunable optical filter
    (Ieee, 2009) Beccherelli, Romeo; Department of Physics; N/A; N/A; Serpengüzel, Ali; Murib, Mohammed Sharif; Hüseyinoğlu, Ersin; Faculty Member; PhD Student; Master Student; Department of Physics; College of Sciences; Graduate School of Sciences and Engineering; Graduate School of Sciences and Engineering; 27855; N/A
    A microsphere placed in a liquid crystal is used as an optical filter. By changing the refractive index of the liquid cyrstal, the resonance frequency of the sphere is controlled.
  • Placeholder
    Publication
    Optimal rate and input format control for content and context adaptive video streaming
    (IEEE, 2004) Department of Electrical and Electronics Engineering; Department of Electrical and Electronics Engineering; N/A; Tekalp, Ahmet Murat; Civanlar, Mehmet Reha; Özçelebi, Tanır; Faculty Member; Faculty Member; PhD Student; Department of Electrical and Electronics Engineering; College of Engineering; College of Engineering; Graduate School of Sciences and Engineering; 26207; 16372; N/A
    A novel dynamic programming based technique for optimal selection of input video format and compression rate for video streaming based on "relevancy" of the content and user context is presented. The technique uses context dependent content analysis to divide the input video into temporal segments. User selected relevance levels assigned to these segments are used in formulating a constrained optimization problem, which is solved using dynamic programming. The technique minimizes a weighted distortion measure and the initial waiting time for continuous playback under maximum acceptable distortion constraints. Spatial resolution and frame rate of input video and the DCT quantization parameters are used as optimization variables. The technique is applied to encoding of soccer videos using an H.264 [1] encoder. The improvements obtained over a standard H.264 implementation are demonstrated by experimental results.
  • Placeholder
    Publication
    Artificial bandwidth extension of speech excitation
    (IEEE, 2015) Department of Computer Engineering; N/A; Erzin, Engin; Turan, Mehmet Ali Tuğtekin; Faculty Member; PhD Student; Department of Computer Engineering; College of Engineering; Graduate School of Sciences and Engineering; 34503; N/A
    In this paper, a new approach that extends narrowband excitation signals to synthesize wide-band speech have been proposed. Bandwidth extension problem is analyzed using source-filter separation framework where a speech signal is decomposed into two independent components. For spectral envelope extension, our former work based on hidden Markov model have been used. For excitation signal extension, the proposed method moves the spectrum based on correlation analysis where the distance between the harmonics and the structure of the excitation signal are preserved in high-bands. In experimental studies, we also apply two other well-known extension techniques for excitation signals comparatively and evaluate the overall performance of proposed system using the PESQ metric. Our findings indicate that the proposed extension method outperforms other two techniques. © 2015 IEEE./ Öz: Bu çalışmada dar bantlı kaynak sinyallerinin bant genişliği artırılarak geniş bantlı konuşma sentezleyen yeni bir yaklaşım önerilmektedir. Bant genişletme problemi kaynak süzgeç analizinin yardımıyla iki bağımsız bileşen üzerinde ayrı ayrı ele alınmıştır. Süzgeç yapısını şekillendiren izgesel zarfı, saklı Markov modeli tabanlı geçmiş çalışmamızı kullanarak iyileştirirken, dar bantlı kaynak sinyalinin genişletilmesi için izgesel kopyalamaya dayalı yeni bir yöntem öneriyoruz. Bu yeni yöntemde dar bantlı kaynak sinyalinin yüksek frekans bileşenlerindeki harmonik yapısını, ilinti analizi ile genişletip geniş bantlı kaynak sinyali sentezlemekteyiz. Öne sürülen bu iyileştirmenin başarımını ölçebilmek için literatürde sıklıkla kullanılan iki ayrı genişletme yöntemi de karşılaştırmalı olarak degerlendirilmekte- dir. Deneysel çalışmalarda öne sürdüğümüz genişletmenin PESQ ölçütüyle nesnel başarımı gösterilmiştir.
  • Placeholder
    Publication
    An extended family of bounded component analysis algorithms
    (IEEE Computer Society, 2015) Department of Electrical and Electronics Engineering; N/A; Erdoğan, Alper Tunga; İnan, Hüseyin Atahan; Faculty Member; Master Student; Department of Electrical and Electronics Engineering; College of Engineering; Graduate School of Sciences and Engineering; 41624; N/A
    Bounded Component Analysis (BCA) is a recent concept proposed as an alternative method for Blind Source Separation problem. BCA enables the separation of dependent as well as independent sources from their mixtures under the practical assumption on source boundedness. This article extends the optimization setting of a recent BCA approach which can be used to produce a variety of BCA algorithms. The article also provides examples of objective functions and the corresponding iterative algorithms. The numerical examples illustrate the advantages of proposed BCA examples regarding the correlated source separation capability over the state of the art ICA based approaches. 1 © 2014 IEEE.