skip to main content
10.1145/1553374acmotherconferencesBook PagePublication PagesicmlConference Proceedingsconference-collections
ICML '09: Proceedings of the 26th Annual International Conference on Machine Learning
ACM2009 Proceeding
Publisher:
  • Association for Computing Machinery
  • New York
  • NY
  • United States
Conference:
ICML '09: The 26th Annual International Conference on Machine Learning held in conjunction with the 2007 International Conference on Inductive Logic Programming Montreal Quebec Canada June 14 - 18, 2009
ISBN:
978-1-60558-516-1
Published:
14 June 2009
Sponsors:
NSF, Microsoft Research, MITACS

Bibliometrics
research-article
Archipelago: nonparametric Bayesian semi-supervised learning

Semi-supervised learning (SSL), is classification where additional unlabeled data can be used to improve accuracy. Generative approaches are appealing in this situation, as a model of the data's probability density can assist in identifying clusters. ...

research-article
Tractable nonparametric Bayesian inference in Poisson processes with Gaussian process intensities

The inhomogeneous Poisson process is a point process that has varying intensity across its domain (usually time or space). For nonparametric Bayesian modeling, the Gaussian process is a useful way to place a prior distribution on this intensity. The ...

research-article
Route kernels for trees

Almost all tree kernels proposed in the literature match substructures without taking into account their relative positioning with respect to one another. In this paper, we propose a novel family of kernels which explicitly focus on this type of ...

research-article
Incorporating domain knowledge into topic modeling via Dirichlet Forest priors

Users of topic modeling methods often have knowledge about the composition of words that should have high or low probability in various topics. We incorporate such domain knowledge using a novel Dirichlet Forest prior in a Latent Dirichlet Allocation ...

research-article
Grammatical inference as a principal component analysis problem

One of the main problems in probabilistic grammatical inference consists in inferring a stochastic language, i.e. a probability distribution, in some class of probabilistic models, from a sample of strings independently drawn according to a fixed ...

research-article
Curriculum learning

Humans and animals learn much better when the examples are not randomly presented but organized in a meaningful order which illustrates gradually more concepts, and gradually more complex ones. Here, we formalize such training strategies in the context ...

research-article
Importance weighted active learning

We present a practical and statistically consistent scheme for actively learning binary classifiers under general loss functions. Our algorithm uses importance weighting to correct sampling bias, and by controlling the variance, we are able to give ...

research-article
Split variational inference

We propose a deterministic method to evaluate the integral of a positive function based on soft-binning functions that smoothly cut the integral into smaller integrals that are easier to approximate. In combination with mean-field approximations for ...

research-article
Predictive representations for policy gradient in POMDPs

We consider the problem of estimating the policy gradient in Partially Observable Markov Decision Processes (POMDPs) with a special class of policies that are based on Predictive State Representations (PSRs). We compare PSR policies to Finite-State ...

research-article
Online feature elicitation in interactive optimization

Most models of utility elicitation in decision support and interactive optimization assume a predefined set of "catalog" features over which user preferences are expressed. However, users may differ in the features over which they are most comfortable ...

research-article
Spectral clustering based on the graph p-Laplacian

We present a generalized version of spectral clustering using the graph p-Laplacian, a nonlinear generalization of the standard graph Laplacian. We show that the second eigenvector of the graph p-Laplacian interpolates between a relaxation of the ...

research-article
Active learning for directed exploration of complex systems

Physics-based simulation codes are widely used in science and engineering to model complex systems that would be infeasible to study otherwise. Such codes provide the highest-fidelity representation of system behavior, but are often so slow to run that ...

research-article
Optimized expected information gain for nonlinear dynamical systems

This paper addresses the problem of active model selection for nonlinear dynamical systems. We propose a novel learning approach that selects the most informative subset of time-dependent variables for the purpose of Bayesian model inference. The model ...

research-article
Probabilistic dyadic data analysis with local and global consistency

Dyadic data arises in many real world applications such as social network analysis and information retrieval. In order to discover the underlying or hidden structure in the dyadic data, many topic modeling techniques were proposed. The typical ...

research-article
Structure learning of Bayesian networks using constraints

This paper addresses exact learning of Bayesian network structure from data and expert's knowledge based on score functions that are decomposable. First, it describes useful properties that strongly reduce the time and memory costs of many known methods ...

research-article
Robust bounds for classification via selective sampling

We introduce a new algorithm for binary classification in the selective sampling protocol. Our algorithm uses Regularized Least Squares (RLS) as base classifier, and for this reason it can be efficiently run in any RKHS. Unlike previous margin-based ...

research-article
Multi-view clustering via canonical correlation analysis

Clustering data in high dimensions is believed to be a hard problem in general. A number of efficient clustering algorithms developed in recent years address this problem by projecting the data into a lower-dimensional subspace, e.g. via Principal ...

research-article
A convex formulation for learning shared structures from multiple tasks

Multi-task learning (MTL) aims to improve generalization performance by learning multiple related tasks simultaneously. In this paper, we consider the problem of learning shared structures from multiple related tasks. We present an improved formulation (...

research-article
Learning kernels from indefinite similarities

Similarity measures in many real applications generate indefinite similarity matrices. In this paper, we consider the problem of classification based on such indefinite similarities. These indefinite kernels can be problematic for standard kernel-based ...

research-article
Matrix updates for perceptron training of continuous density hidden Markov models

In this paper, we investigate a simple, mistake-driven learning algorithm for discriminative training of continuous density hidden Markov models (CD-HMMs). Most CD-HMMs for automatic speech recognition use multivariate Gaussian emission densities (or ...

research-article
Decision tree and instance-based learning for label ranking

The label ranking problem consists of learning a model that maps instances to total orders over a finite set of predefined labels. This paper introduces new methods for label ranking that complement and improve upon existing approaches. More ...

research-article
Learning dictionaries of stable autoregressive models for audio scene analysis

In this paper, we explore an application of basis pursuit to audio scene analysis. The goal of our work is to detect when certain sounds are present in a mixed audio signal. We focus on the regime where out of a large number of possible sources, a small ...

research-article
Exploiting sparse Markov and covariance structure in multiresolution models

We consider Gaussian multiresolution (MR) models in which coarser, hidden variables serve to capture statistical dependencies among the finest scale variables. Tree-structured MR models have limited modeling capabilities, as variables at one scale are ...

research-article
Nonparametric estimation of the precision-recall curve

The Precision-Recall (PR) curve is a widely used visual tool to evaluate the performance of scoring functions in regards to their capacities to discriminate between two populations. The purpose of this paper is to examine both theoretical and practical ...

research-article
EigenTransfer: a unified framework for transfer learning

This paper proposes a general framework, called EigenTransfer, to tackle a variety of transfer learning problems, e.g. cross-domain learning, self-taught learning, etc. Our basic idea is to construct a graph to represent the target transfer learning ...

research-article
Fitting a graph to vector data

We introduce a measure of how well a combinatorial graph fits a collection of vectors. The optimal graphs under this measure may be computed by solving convex quadratic programs and have many interesting properties. For vectors in d dimensional space, ...

research-article
Unsupervised search-based structured prediction

We describe an adaptation and application of a search-based structured prediction algorithm "Searn" to unsupervised learning problems. We show that it is possible to reduce unsupervised learning to supervised learning and demonstrate a high-quality un-...

research-article
Deep transfer via second-order Markov logic

Standard inductive learning requires that training and test instances come from the same distribution. Transfer learning seeks to remove this restriction. In shallow transfer, test instances are from the same domain, but have a different distribution. ...

research-article
Analytic moment-based Gaussian process filtering

We propose an analytic moment-based filter for nonlinear stochastic dynamic systems modeled by Gaussian processes. Exact expressions for the expected value and the covariance matrix are provided for both the prediction step and the filter step, where an ...

research-article
Good learners for evil teachers

We consider a supervised machine learning scenario where labels are provided by a heterogeneous set of teachers, some of which are mediocre, incompetent, or perhaps even malicious. We present an algorithm, built on the SVM framework, that explicitly ...

Cited By

  1. ACM
    Han C, Castells P, Gupta P, Xu X and Salaka V Addressing Cold Start in Product Search via Empirical Bayes Proceedings of the 31st ACM International Conference on Information & Knowledge Management, (3141-3151)
  2. ACM
    Ma X, Guo J, Zhang R, Fan Y and Cheng X Pre-train a Discriminative Text Encoder for Dense Retrieval via Contrastive Span Prediction Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval, (848-858)
  3. Dougherty K, Melkonian V and Montenegro G (2021). Artificial intelligence in polyp detection - where are we and where are we headed?, Artificial Intelligence in Gastrointestinal Endoscopy, 10.37126/aige.v2.i6.211, 2:6, (211-219), Online publication date: 28-Dec-2022.
  4. Deng X, Sun H, Xu G, Yu Y, Wang H, Nakashima S, Mu S and Lu H (2021). RLGC: residual low rank group sparsity constraint for image denoising International Symposium on Artificial Intelligence and Robotics 2021, 10.1117/12.2605808, 9781510646124, (77)
  5. Espinosa-Anke L, Declerck T, Gromann D, Vilalta A, Garcia-Gasulla D, Parés F, Ayguadé E, Labarta J, Moya-Sánchez E, Cortés U, Gromann D, Espinosa Anke L and Declerck T (2019). Studying the impact of the Full-Network embedding on multimodal pipelines, Semantic Web, 10:5, (909-923), Online publication date: 1-Jan-2019.
  6. Wang Y, Pham T, Vozenilek V, Zhang D, Xie Y, Zhang D, Zhang D, Li Y and Wang W (2017). Multi-scales region segmentation for ROI separation in digital mammograms Eighth International Conference on Graphic and Image Processing, 10.1117/12.2267046, , (1022515), Online publication date: 8-Feb-2017.
  7. Ternovskiy I, Chin P, Yu K, Harang R and Wood K (2017). Machine learning for intrusion detection in mobile tactical networks SPIE Defense + Security, 10.1117/12.2261683, , (1018504), Online publication date: 1-May-2017.
  8. Falco C, Jiang X, Liu B, Kong L, Zhao J, Wu J and Tan Z (2016). Towards 3D object recognition with contractive autoencoders Eighth International Conference on Digital Image Processing (ICDIP 2016), 10.1117/12.2243988, , (100330S), Online publication date: 29-Aug-2016.
  9. Kadar I, Flenner A, Culp M, McGee R, Flenner J and Garcia-Cardona C (2015). Learning representations for improved target identification, scene classification, and information fusion SPIE Defense + Security, 10.1117/12.2176348, , (94740W), Online publication date: 21-May-2015.
  10. Ahmad F, Zhang Y and Wang B (2015). Group sparsity based spectrum estimation of harmonic speech signals SPIE Sensing Technology + Applications, 10.1117/12.2180327, , (94840N), Online publication date: 19-May-2015.
  11. Blowers M, Williams J, Payer G, McCormick C and Harang R (2014). Applying hardware-based machine learning to signature-based network intrusion detection SPIE Sensing Technology + Applications, 10.1117/12.2052548, , (91190C), Online publication date: 22-May-2014.
  12. Fuqiang C, Yan W, Yude B and Guodong Z (2014). Spectral Classification Using Restricted Boltzmann Machine, Publications of the Astronomical Society of Australia, 10.1017/pasa.2013.38, 31,
  13. Ternovskiy I, Chin P, Payer G, McCormick C and Harang R (2014). Applying hardware-based machine learning to signature-based network intrusion detection SPIE Defense + Security, 10.1117/12.2049890, , (909702), Online publication date: 18-Jun-2014.
  14. Vuksanovic B, Zhou J, Verikas A, Liu B and Fan H (2013). Semantic labeling of indoor scenes from RGB-D images with discriminative learning Sixth International Conference on Machine Vision (ICMV 13), 10.1117/12.2049805, , (90670C), Online publication date: 24-Dec-2013.
  15. Liu B (2012). Sentiment Analysis and Opinion Mining, Synthesis Lectures on Human Language Technologies, 10.2200/S00416ED1V01Y201204HLT016, 5:1, (1-167), Online publication date: 23-May-2012.
  16. Szepesvári C (2010). Algorithms for Reinforcement Learning, Synthesis Lectures on Artificial Intelligence and Machine Learning, 10.2200/S00268ED1V01Y201005AIM009, 4:1, (1-103), Online publication date: 1-Jan-2010.
Contributors
  • Williams College
  • New York University
  • Brown University

Recommendations

Acceptance Rates

Overall Acceptance Rate140of548submissions,26%
YearSubmittedAcceptedRate
ICML '0654814026%
Overall54814026%