Research overview |
My research goal is to develop scalable, interoperable, parallel algorithms
available as libraries and programs to analyze data in a broad sense.
I currently work
on adapting and optimizing a broad spectrum of algorithms in machine learning and graphs.
I conduct research on both
multi-node processing with distributed memory/storage,
as well as multi-threaded processing on multicore CPUs and GPUs.
I work on many science and engineering problems,
but most of my applied research is in medicine and healthcare.
In my lab develop programs in C++, SQL and Python, mostly in Unix.
|
Research topics (overview) |
- Extending and optimizing data science languages, like Python and R, to analyze big data
- Scalable and Parallel algorithms for Big Data analytics: mainly machine learning and graphs.
- Eliminating RAM, interoperability and speed limitations from
data science programming languages (Python, R).
`
- Big data problems: classifying documents,
information retrieval on bibliography records,
keyword search, large-scale matrix multiplication,
ontology construction, linked data and semantic web.
- Data science applications in medicine.
- Database modeling:
extending ER database models to manage data pre-processing,
managing analytic workflows, solving data quality issues, querying source code.
- Query processing: recursive queries, joins on graphs, cubes, skylines, pivoting,
workload optimization, data partitioning.
|
Articles available for download |
Click on the top menu to download
author-prepared, unofficial, versions of published articles:
journal articles,
conference/workshop proceedings, presentation slides.
These PDF files have 98% the same content as the official version, but with different format.
Journal articles present the most important research results in polished form,
whereas proceedings articles present recent and preliminary research.
|
Articles in digital libraries, grant support |
-
DBLP (90% complete;
big subset of ACM; 2 months behind ACM)
-
Google Scholar (citations; 95% complete)
|
International Academic Service
|
Journals:
- Associate Editor IEEE Transactions on Knowledge and Data Engineering (TKDE) 2017-2021.
- Associate Editor Data & Knowledge Engineering (DKE) 2019-2023.
- Associate Editor Intelligent Data Analysis (IDA) 2010-2013.
Conferences:
- Program Chair: DOLAP 2010, 2015.
- Program Chair: SADASC 2020.
- Program Chair: DaWaK 2018, DaWaK 2019.
- Program Chair: MEDI 2018.
- Program ViceChair: Big Data 2020.
- PC member: IEEE Big Data 2016, 2020,2021,2022.
- PC member: DaWaK, 2020,2021,2022.
- PC member: DOLAP 2008-2021.
- PC member: DEXA 2018-2020.
- PC member: BDA 2020.
- PC member: ADBIS 2020.
- PC member: SIGMOD 2016, SIGMOD 2017.
- PC member: AMW 2015, AMW 2016, AMW 2019, AMW 2020.
- PC member: KDD 2014, KDD 2015.
|
Colleagues and Collaborators |
- Divesh Srivastava
ATT Labs, USA
- Mike Stonebraker
MIT, USA
- Ladjel Bellatreche
ENSMA, France
- Il-Yeol Song
Drexel University, USA
- Joe Hellerstein
University of California at Berkeley, USA
- Ophir Frieder
Georgetown University, USA
- Chris Jermaine
Rice University, USA
- Hanna Oktaba
UNAM, Mexico
- Hamid Pirahesh
IBM, USA
- Sofian Maabout
LaBRI, France
- Oscar Romero
UPC, Espana
- Ahmad Ghazal
Facebook
- Javier Garcia-Garcia
UNAM, Mexico
- Luis B. Morales
UNAM, Mexico
- Esteban Zimanyi
Universite Libre de Bruxelles, Belgium
- Nicolas Lachiche
University of Strasbourg, France
- Hiroshi Oyama
The University of Tokyo, Japan
|