Research Outputs

Permanent URI for this communityhttps://hdl.handle.net/20.500.14288/2

Browse

Search Results

Now showing 1 - 10 of 36

Open Access
3D microprinting of iron platinum nanoparticle-based magnetic mobile microrobots
(Wiley, 2021) Giltinan, Joshua; Sridhar, Varun; Bozüyük, Uğur; Sheehan, Devin; Department of Mechanical Engineering; Sitti, Metin; Faculty Member; Department of Mechanical Engineering; School of Medicine; College of Engineering; 297104
Wireless magnetic microrobots are envisioned to revolutionize minimally invasive medicine. While many promising medical magnetic microrobots are proposed, the ones using hard magnetic materials are not mostly biocompatible, and the ones using biocompatible soft magnetic nanoparticles are magnetically very weak and, therefore, difficult to actuate. Thus, biocompatible hard magnetic micro/nanomaterials are essential toward easy-to-actuate and clinically viable 3D medical microrobots. To fill such crucial gap, this study proposes ferromagnetic and biocompatible iron platinum (FePt) nanoparticle-based 3D microprinting of microrobots using the two-photon polymerization technique. A modified one-pot synthesis method is presented for producing FePt nanoparticles in large volumes and 3D printing of helical microswimmers made from biocompatible trimethylolpropane ethoxylate triacrylate (PETA) polymer with embedded FePt nanoparticles. The 30 mu m long helical magnetic microswimmers are able to swim at speeds of over five body lengths per second at 200Hz, making them the fastest helical swimmer in the tens of micrometer length scale at the corresponding low-magnitude actuation fields of 5-10mT. It is also experimentally in vitro verified that the synthesized FePt nanoparticles are biocompatible. Thus, such 3D-printed microrobots are biocompatible and easy to actuate toward creating clinically viable future medical microrobots.
Metadata only
3D model retrieval using probability density-based shape descriptors
(IEEE Computer Society, 2009) Akgul, Ceyhun Burak; Sankur, Buelent; Schmitt, Francis; Department of Computer Engineering; Yemez, Yücel; Faculty Member; Department of Computer Engineering; College of Engineering; 107907
We address content-based retrieval of complete 3D object models by a probabilistic generative description of local shape properties. The proposed shape description framework characterizes a 3D object with sampled multivariate probability density functions of its local surface features. This density-based descriptor can be efficiently computed via kernel density estimation (KDE) coupled with fast Gauss transform. The nonparametric KDE technique allows reliable characterization of a diverse set of shapes and yields descriptors which remain relatively insensitive to small shape perturbations and mesh resolution. Density-based characterization also induces a permutation property which can be used to guarantee invariance at the shape matching stage. As proven by extensive retrieval experiments on several 3D databases, our framework provides state-of-the-art discrimination over a broad and heterogeneous set of shape categories.
Metadata only
A deterministic analysis of an online convex mixture of experts algorithm
(Institute of Electrical and Electronics Engineers (IEEE), 2015) Özkan, Hüseyin; Dönmez, Mehmet A.; N/A; Tunç, Sait; Master Student; Graduate School of Sciences and Engineering; N/A
We analyze an online learning algorithm that adaptively combines outputs of two constituent algorithms (or the experts) running in parallel to estimate an unknown desired signal. This online learning algorithm is shown to achieve and in some cases outperform the mean-square error (MSE) performance of the best constituent algorithm in the steady state. However, the MSE analysis of this algorithm in the literature uses approximations and relies on statistical models on the underlying signals. Hence, such an analysis may not be useful or valid for signals generated by various real-life systems that show high degrees of nonstationarity, limit cycles and that are even chaotic in many cases. In this brief, we produce results in an individual sequence manner. In particular, we relate the time-accumulated squared estimation error of this online algorithm at any time over any interval to the one of the optimal convex mixture of the constituent algorithms directly tuned to the underlying signal in a deterministic sense without any statistical assumptions. In this sense, our analysis provides the transient, steady-state, and tracking behavior of this algorithm in a strong sense without any approximations in the derivations or statistical assumptions on the underlying signals such that our results are guaranteed to hold. We illustrate the introduced results through examples.
Metadata only
A discrete-continuous optimization approach for the design and operation of synchromodal transportation networks
(Elsevier, 2019) Reşat, Hamdi Giray; Department of Industrial Engineering; Türkay, Metin; Faculty Member; Department of Industrial Engineering; College of Engineering; 24956
This paper presents a multi-objective mixed-integer programming problem for integrating specific characteristics of synchromodal transportation. The problem includes different objective functions including total transportation cost, travel time and CO2 emissions while optimizing the proposed network structure. Traffic congestion, time-dependent vehicle speeds and vehicle filling ratios are considered and computational results for different illustrative cases are presented with real data from the Marmara Region of Turkey. The defined non-linear model is converted into linear form and solved by using a customized implementation of the e-constraint method. Then, the sensitivity analysis of proposed mathematical models with pre-processing constraints is summarized for decision makers.
Metadata only
AffectON: Incorporating affect into dialog generation
(IEEE-Inst Electrical Electronics Engineers Inc, 2023) Bucinca, Zana; Department of Computer Engineering; Yemez, Yücel; Erzin, Engin; Sezgin, Tevfik Metin; Department of Computer Engineering; Koç Üniversitesi İş Bankası Yapay Zeka Uygulama ve Araştırma Merkezi (KUIS AI)/ Koç University İş Bank Artificial Intelligence Center (KUIS AI); College of Engineering
Due to its expressivity, natural language is paramount for explicit and implicit affective state communication among humans. The same linguistic inquiry (e.g., How are you?) might induce responses with different affects depending on the affective state of the conversational partner(s) and the context of the conversation. Yet, most dialog systems do not consider affect as constitutive aspect of response generation. In this article, we introduce AffectON, an approach for generating affective responses during inference. For generating language in a targeted affect, our approach leverages a probabilistic language model and an affective space. AffectON is language model agnostic, since it can work with probabilities generated by any language model (e.g., sequence-to-sequence models, neural language models, n-grams). Hence, it can be employed for both affective dialog and affective language generation. We experimented with affective dialog generation and evaluated the generated text objectively and subjectively. For the subjective part of the evaluation, we designed a custom user interface for rating and provided recommendations for the design of such interfaces. The results, both subjective and objective demonstrate that our approach is successful in pulling the generated language toward the targeted affect, with little sacrifice in syntactic coherence.
Metadata only
An extended formulation of moldable task scheduling problem and its application to quay crane assignments
(Pergamon-Elsevier Science Ltd, 2021) Ünsal, Özgür; PhD Student; Graduate School of Sciences and Engineering; N/A
In this paper, we study an extended formulation of moldable task scheduling problem (MTSP) motivated by the assignments of quay cranes to vessels. In container terminals, handling time of a vessel depends on the number of quay cranes assigned to that vessel. This characteristic allows us to model quay crane assignment problem (QCAP) as a variant of MTSP. By considering the modeling requirements of various properties of QCAP, we develop an extended formulation of MTSP with specific task to machine assignments. Even though this formulation brings modeling flexibility, it can only be solved for small instances because of its size. For this reason, we provide a generic solution algorithm based on a logic based Benders decomposition by utilizing the extended formulation. There are various characteristics of QCAP observed in different terminals. Accordingly, we implement the proposed decomposition algorithm for contiguous assignments of QCs, uniform QCs as well as the availability of QCs. Computational experiments show that the proposed algorithm is able to solve instances of considerable sizes to optimality and provides a modeling flexibility that allows implementation to different terminal settings.
Metadata only
Analysis of head gesture and prosody patterns for prosody-driven head-gesture animation
(IEEE Computer Soc, 2008) Sargin, Mehmet Emre; Department of Computer Engineering; Department of Computer Engineering; Department of Electrical and Electronics Engineering; Yemez, Yücel; Erzin, Engin; Tekalp, Ahmet Murat; Faculty Member; Faculty Member; Faculty Member; Department of Computer Engineering; Department of Electrical and Electronics Engineering; College of Engineering; College of Engineering; College of Engineering; 107907; 34503; 26207
We propose a new two-stage framework for joint analysis of head gesture and speech prosody patterns of a speaker toward automatic realistic synthesis of head gestures from speech prosody. In the first stage analysis, we perform Hidden Markov Model (HMM)-based unsupervised temporal segmentation of head gesture and speech prosody features separately to determine elementary head gesture and speech prosody patterns, respectively, for a particular speaker. In the second stage, joint analysis of correlations between these elementary head gesture and prosody patterns is performed using Multistream HMMs to determine an audio-visual mapping model. The resulting audio-visual mapping model is then employed to synthesize natural head gestures from arbitrary input test speech given a head model for the speaker. In the synthesis stage, the audio-visual mapping model is used to predict a sequence of gesture patterns from the prosody pattern sequence computed for the input test speech. The Euler angles associated with each gesture pattern are then applied to animate the speaker head model. Objective and subjective evaluations indicate that the proposed synthesis by analysis scheme provides natural looking head gestures for the speaker with any input test speech, as well as in "prosody transplant" and "gesture transplant" scenarios.
Metadata only
Convolutive bounded component analysis algorithms for independent and dependent source separation
(IEEE-inst Electrical Electronics Engineers inc, 2015) N/A; N/A; Department of Electrical and Electronics Engineering; İnan, Hüseyin Atahan; Erdoğan, Alper Tunga; Master Student; Faculty Member; Department of Electrical and Electronics Engineering; Graduate School of Sciences and Engineering; College of Engineering; N/A; 41624
Bounded component analysis (BCa) is a framework that can be considered as a more general framework than independent component analysis (ICa) under the boundedness constraint on sources. Using this framework, it is possible to separate dependent as well as independent components from their mixtures. in this paper, As an extension of a recently introduced instantaneous BCa approach, we introduce a family of convolutive BCa criteria and corresponding algorithms. We prove that the global optima of the proposed criteria, under generic BCa assumptions, Are equivalent to a set of perfect separators. the algorithms introduced in this paper are capable of separating not only the independent sources but also the sources that are dependent/correlated in both component (space) and sample (time) dimensions. therefore, under the condition that the sources are bounded, they can be considered as extended convolutive ICa algorithms with additional dependent/correlated source separation capability. Furthermore, they have potential to provide improvement in separation performance, especially for short data records. This paper offers examples to illustrate the space-time correlated source separation capability through a copula distribution-based example. in addition, A frequency-selective Multiple input Multiple Output equalization example demonstrates the clear performance advantage of the proposed BCa approach over the state-of-the-art ICa-based approaches in setups involving convolutive mixtures of digital communication sources.
Open Access
Emotion dependent domain adaptation for speech driven affective facial feature synthesis
(Institute of Electrical and Electronics Engineers (IEEE), 2022) Department of Electrical and Electronics Engineering; Erzin, Engin; Sadiq, Rizwan; Faculty Member; Department of Electrical and Electronics Engineering; Koç Üniversitesi İş Bankası Yapay Zeka Uygulama ve Araştırma Merkezi (KUIS AI)/ Koç University İş Bank Artificial Intelligence Center (KUIS AI); College of Engineering; 34503; N/A
Although speech driven facial animation has been studied extensively in the literature, works focusing on the affective content of the speech are limited. This is mostly due to the scarcity of affective audio-visual data. In this article, we improve the affective facial animation using domain adaptation by partially reducing the data scarcity. We first define a domain adaptation to map affective and neutral speech representations to a common latent space in which cross-domain bias is smaller. Then the domain adaptation is used to augment affective representations for each emotion category, including angry, disgust, fear, happy, sad, surprise, and neutral, so that we can better train emotion-dependent deep audio-to-visual (A2V) mapping models. Based on the emotion-dependent deep A2V models, the proposed affective facial synthesis system is realized in two stages: first, speech emotion recognition extracts soft emotion category likelihoods for the utterances; then a soft fusion of the emotion-dependent A2V mapping outputs form the affective facial synthesis. Experimental evaluations are performed on the SAVEE audio-visual dataset. The proposed models are assessed with objective and subjective evaluations. The proposed affective A2V system achieves significant MSE loss improvements in comparison to the recent literature. Furthermore, the resulting facial animations of the proposed system are preferred over the baseline animations in the subjective evaluations.
Open Access
End to end rate distortion optimized learned hierarchical bi-directional video compression
(Institute of Electrical and Electronics Engineers (IEEE), 2022) Department of Electrical and Electronics Engineering; Tekalp, Ahmet Murat; Yılmaz, Mustafa Akın; Faculty Member; Department of Electrical and Electronics Engineering; College of Engineering; 26207; N/A
Conventional video compression (VC) methods are based on motion compensated transform coding, and the steps of motion estimation, mode and quantization parameter selection, and entropy coding are optimized individually due to the combinatorial nature of the end-to-end optimization problem. Learned VC allows end-to-end rate-distortion (R-D) optimized training of nonlinear transform, motion and entropy model simultaneously. Most works on learned VC consider end-to-end optimization of a sequential video codec based on R-D loss averaged over pairs of successive frames. It is well-known in conventional VC that hierarchical, bi-directional coding outperforms sequential compression because of its ability to use both past and future reference frames. This paper proposes a learned hierarchical bi-directional video codec (LHBDC) that combines the benefits of hierarchical motion-compensated prediction and end-to-end optimization. Experimental results show that we achieve the best R-D results that are reported for learned VC schemes to date in both PSNR and MS-SSIM. Compared to conventional video codecs, the R-D performance of our end-to-end optimized codec outperforms those of both x265 and SVT-HEVC encoders ("veryslow" preset) in PSNR and MS-SSIM as well as HM 16.23 reference software in MS-SSIM. We present ablation studies showing performance gains due to proposed novel tools such as learned masking, flow-field subsampling, and temporal flow vector prediction. The models and instructions to reproduce our results can be found in https://github.com/makinyilmaz/LHBDC/.

Research Outputs

Browse

Filters

Advanced Search

Filter by

Settings

Sort By

Results per page

Search Results