Publications
2025
- NeoBERT: A Next-Generation BERTarXiv preprint arXiv:2502.19587, 2025
- Structure-Aligned Protein Language ModelarXiv preprint arXiv:2505.16896, 2025
- Small Encoders Can Rival Large Decoders in Detecting GroundednessarXiv preprint arXiv:2506.21288, 2025
- Combining Domain and Alignment Vectors Provides Better Knowledge-Safety Trade-offs in LLMsIn Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), 2025
- CADmium: Fine-Tuning Code Language Models for Text-Driven Sequential CAD DesignarXiv preprint arXiv:2507.09792, 2025
2024
- Predicting the Impact of Model Expansion through the Minima Manifold: A Loss Landscape Perspective2024
- A deep dive into the trade-offs of parameter-efficient preference alignment techniquesarXiv preprint arXiv:2406.04879, 2024
- Exploring quantization for efficient pre-training of transformer language modelsarXiv preprint arXiv:2407.11722, 2024
- Combining domain and alignment vectors to achieve better knowledge-safety trade-offs in llmsarXiv preprint arXiv:2411.06824, 2024
- Protein language models: Is scaling necessary?bioRxiv, 2024
- Interpolate: How Resetting Active Neurons can also improve Generalizability in Online Learning2024
2023
- A practical survey on faster and lighter transformersACM Computing Surveys, 2023
- Detection of microservice-based software anomalies based on OpenTracing in cloudSoftware: Practice and Experience, 2023
- Distributed computation of the critical path from execution tracesSoftware: Practice and Experience, 2023
- Language models for novelty detection in system call tracesarXiv preprint arXiv:2309.02206, 2023
- Energy and carbon-aware initial VM placement in geographically distributed cloud data centersSustainable Computing: Informatics and Systems, 2023
2022
- Machine Learning for Anomaly Detection in Kernel Traces2022
2021
- On improving deep learning trace analysis with system call argumentsIn 2021 IEEE/ACM 18th International Conference on Mining Software Repositories (MSR), 2021
- Automated cause analysis of latency outliers using system-level dependency graphsIn 2021 IEEE 21st International Conference on Software Quality, Reliability and Security (QRS), 2021
2020
- Depgraph: Localizing performance bottlenecks in multi-core applications using waiting dependency graphs and software tracingIn 2020 IEEE 20th International Working Conference on Source Code Analysis and Manipulation (SCAM), 2020
2019
- Empirical comparison between autoencoders and traditional dimensionality reduction methodsIn 2019 IEEE Second International Conference on Artificial Intelligence and Knowledge Engineering (AIKE), 2019
- Automatic cause detection of performance problems in web applicationsIn 2019 IEEE International Symposium on Software Reliability Engineering Workshops (ISSREW), 2019