Sitemap
A list of all the posts and pages found on the site. For you robots out there is an XML version available for digesting as well.
Pages
Posts
projects
Leveraging IA3 adapters for parameter-efficient logical deduction, Lightning AI
Investigates IA3 adapters for parameter efficient fine-tuning to enhance Llama-3’s logical deduction capabilities. (Apr. 2024)
Optimizing Reinforcement Policies for Aligning LLMs, Weights & Biases
Motivates the formulation of the proximal policy optimization algorithm and applies it for reinforcement learning from human feedback (RLHF) to align Google’s Gemma with human conversational preferences. (Mar. 2024)
Exploring Diffusion Models with JAX
Explores the mathematics and code of discrete diffusion models, assessing their effectiveness, applicability, and limitations. (Jun 2022)
Review Classification using Active Learning, Keras.io
Demonstrates the effectiveness of active learning in text classification tasks by implementing a ratio- based sampling approach, improving performance and suitability for industrial applications. (Oct 2021)
The Reality Behind the Optimization of Imaginary Variables - I, Weights & Biases
Examines how complex-valued neural networks enhance representations by experimenting with traditional mappings and more efficient complex vector techniques. (Jun 2021)
The Reality Behind the Optimization of Imaginary Variables - II, Weights & Biases
Explores circularity and holomorphicity constraints in complex-valued neural networks, showcasing the effectiveness of linear and widely linear networks for image-denoising. (Oct 2021)
MelGAN based Spectrogram Inversion using Feature Matching, Keras.io
Showcases a non-autoregressive model for spectral inversion utilizing a feature-matching discriminator, highlighting fast inference on dynamic inputs. (Sep 2021)
publications
Robust and explainable identification of logical fallacies in natural language arguments
Published in Knowledge Based Systems, 2023
This paper formalizes prior theoretical work on logical fallacies into a comprehensive three-stage evaluation framework of detection, coarse- grained, and fine-grained classification, and employs three families of robust and explainable methods based on prototype reasoning, instance-based reasoning, and knowledge injection.
Recommended citation: Sourati, Z., Venkatesh, V. P. P., Deshpande, D., Rawlani, H., Ilievski, F., Sandlin, H., & Mermoud, A. (2023). Robust and explainable identification of logical fallacies in natural language arguments. Knowledge-Based Systems, 266, 110418. https://doi.org/10.1016/j.knosys.2023.110418 https://www.sciencedirect.com/science/article/pii/S0950705123001685
Robust Text Classification: Analyzing Prototype-Based Networks
Published in arXiv, 2023
A modular and comprehensive framework for studying Prototype Based Networks (PBNs), which includes different backbone architectures, backbone sizes, and objective functions is designed, which shows that the robustness of PBNs transfers to NLP classification tasks facing realistic perturbations.
Recommended citation: Sourati, Z., Deshpande, D., Ilievski, F., Gashteovski, K., & Saralajew, S. (2023). Robust Text Classification: Analyzing Prototype-Based Networks. ArXiv, abs/2311.06647. https://arxiv.org/abs/2311.06647
Contextualizing Argument Quality Assessment with Relevant Knowledge
Published in NAACL-2024, 2024
This work proposes SPARK: a novel method for scoring argument quality based on contextualization via relevant knowledge, and devise four augmentations that leverage large language models to provide feedback, infer hidden assumptions, supply a similar-quality argument, or a counterargument.
Recommended citation: Darshan Deshpande, Zhivar Sourati, Filip Ilievski, and Fred Morstatter. 2024. Contextualizing Argument Quality Assessment with Relevant Knowledge. In Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 2: Short Papers), pages 316–326, Mexico City, Mexico. Association for Computational Linguistics. https://aclanthology.org/2024.naacl-short.28/
GNOME: Generating Negotiations through Open-Domain Mapping of Exchanges
Published in arXiv, 2024
Introduces GNOME, an automated framework that uses Large Language Models to generate synthetic open-domain negotiation dialogues from closed-domain datasets, addressing the limited generalizability of existing negotiation models. Experiments show that models trained on GNOME-generated data outperform state-of-the-art models in both domain-specific strategy prediction and generalization to novel domains, while reducing manual data curation efforts.
Recommended citation: Deshpande, D., Sinha, S., Kumar, A., Pal, D. & May, J. (2024). GNOME: Generating Negotiations through Open-Domain Mapping of Exchanges. ArXiv, abs/2406.10764. https://arxiv.org/abs/2305.12280
talks
Diffusing Away from GANs and Transformers
Published:
Discussing the hype surrounding diffusion models and exploring their effectiveness, applicability, and drawbacks. [slides]