References
Interesting Projects
- Biomedical Harvesters
- BiomedicalTextMining
- GROBID - Generation of Bibliographic Data
- Indra Labs
- FrameNet
- Global FrameNet - hub for collaboration in the development of frame-based language resources and applications
- FrameNet Brasil Computational Linguistics Lab
- Global FrameNet [reddit]
- Open Research Knowledge Graph (ORKG) aims to describe research papers in a structured manner
- Enterprise Knowledge Graph Foundation (EKGF)
- DebateGraph
Software, Tools, Libraries
- spaCy: Industrial-Strength Natural Language Processing
- Argdown: A simple syntax for complex argumentation
- LangChain: framework for developing applications powered by language models; also available in JavaScript
- Interactive Composition Explorer: trace visualizer for language model programs
- PyG: ultimate library for Graph Neural Networks
- Open Graph protocol enables any web page to become a rich object in a social graph
- GraphQL: developer-friendly query language for the modern web
- AsyncAPI for event-driven architectures
- Ceramic Protocol: smart document protocol for an open dataweb
- Open-SESAME: frame-semantic parser for automatically detecting FrameNet frames and their frame-elements from sentences
Articles, Tutorials, Demos
- Emerging Architectures for LLM Applications [a16z.com] 20230610
- Llama, Alpaca and Vicuna: “Chatgpt” running on your laptop [medium] 20230415
- Building a Semantic Search Engine With OpenAI and Pinecone - github 20230323
- FrameNet Parsing with Transformers [github.io] 20220524
- Language Models are Open Knowledge Graphs .. but are hard to mine! [medium] 20210117
- Frame Semantic Transformer [github.io]
- NLTK Sample Usage for FrameNet [nltk.org]
- Python Tutorials for NLP, ML, AI [github]
Dataset Sources
- NIEMOpen: community-driven common vocabulary; enables efficient information exchange
- Human Cell Atlas
- Food Nutrition Data
- EMMAA - primarily deals at the molecular level (pathways, bindings, etc) of disease processes
- KaBOB: Ontology-Based Semantic Integration of Biomedical Databases
- UC Denver biomedical data sources
- Universal Wordnet
- VerbNet
- YAGO: A High-Quality Knowledge Base
- YAGO 3 [github] combines the information from the Wikipedias in multiple languages with WordNet, GeoNames, and other data sources.
- YAGO-SUMO: A Large-Scale Formal Ontology
- YAGO 4: A Reason-able Knowledge Base - Academic Paper on YAGO 4
- BioPortal - National Center for Biomedical Ontology [github]
ML Models
- LLaMa for CPU
- AI2 OLMo
- OpenLLaMa - Hugging Face Model
- https://github.com/lm-sys/FastChat
- LLaMa Chat allows you to chat with LLaMa, Alpaca and GPT4All models1 running locally
- Vicuna
- BLOOM [huggingface]: BigScience Large Open-science Open-access Multilingual Language Model
Publications
- Duplicate question identification by integrating framenet with neural networks [acm.org]
- Semantic Search for Biomedical Texts using Predicate-Argument Structure
- Open-Domain Frame Semantic Parsing Using Transformers [arXiv]
- Frame-Semantic Parsing with Softmax-Margin Segmental RNNs and a Syntactic Scaffold [arXiv]
- Open Research Knowledge Graph:A System Walkthrough [arXiv]