Obsidian Extract PDF Annotations Plugin

Overview

Obsidian Extract PDF Annotations Plugin

This is a plugin for Obsidian. It extracts Annotations from PDF Files.

Usage

This Plugin visits all PDF files in a given directory and extracts comments and highlights from the PDF files. It treats the first line of every comment as Topic for grouping the comments.

Assume we have in a folder in our Vault containing PDF files, e.g:

vault_folder

and we have highlighted the Julia Hello World Programm with a note 'Hello World':

pdf_note

In the editor (e.g. _Extract) we run the plugin's command Extract PDF Annotations (Hotkey Ctrl-P for all Commands). This will fetch all annotations in the PDF files in the current folder and sort them by Topic:

extracted_annotations

As such, you can relate comments for your topics (here 'Hello World') from several PDF files.

Commands

  • Extract PDF Annotations Works when editing a markdown note. Searches all PDF files in current Folder for annotations, and inserts them at the current position of the open note.
  • Extract PDF Annotations on single file Works while displaying a PDF file. Extracts annotations from this file and writes them to the note Annotations for

Plugin Settings:

  • Use the first line of the comment as 'Topic' (and sort accordingly), or not
  • Use folder name or PDF-Filename for sorting

Versions

1.0.4 clean up hyphenation #5 1.0.3 updated highlight fetching to use QuadPoints instead of Rectangles

Installation / Build

Fetch repository:

$ git clone https://github.com/munach/obsidian-extract-pdf-annotations.git
$ cd obsidian-extract-pdf-annotations

Install dependencies:

$ npm i

Transpile main.ts:

$ npm run build

Then create the plugin directory and copy the files main.js and manifest.json, e.g.;

$ mkdir ~/MyVault/.obsidian/plugins/obsidian-extract-pdf-annotations
$ cp main.js manifest.json ~/MyVault/.obsidian/plugins/obsidian-extract-pdf-annotations/

Enable the plugin in Obsidan's setting.

Issues / Bugs

[] works only on left-to-right highlights

Credits

This plugin builds on ideas from Alexis Rondeaus Plugin https://github.com/akaalias/obsidian-extract-pdf-highlights, but uses obsidians build-in pdf.js library.

Author

Franz Achermann

You might also like...

Obsidian Web: Connect your browser with your Obsidian notes

 Obsidian Web: Connect your browser with your Obsidian notes

Obsidian Web: Connect your browser with your Obsidian notes This is an unofficial Chrome extension for Obsidian that lets you send content from the we

Jan 3, 2023

This project is used to extract media from various posting platfroms like Twitter, Reddit, Pixiv, Youtube and many other

Social-Picker-API This project is used to extract media from various posting platfroms like Twitter, Reddit, Pixiv, Youtube and many others. It's writ

Nov 29, 2022

The frontend of a full stack application of a personal wallet made with React, Node and MongoDB that allows you to add inputs, outputs and see all your extract.

The frontend of a full stack application of a personal wallet made with React, Node and MongoDB that allows you to add inputs, outputs and see all your extract.

The frontend of a full stack application of a personal wallet made with React, Node and MongoDB that allows you to add inputs, outputs and see all your extract.

Jun 2, 2022

The backend of a full stack application of a personal wallet made with React, Node and MongoDB that allows you to add inputs, outputs and see all your extract.

The backend of a full stack application of a personal wallet made with React, Node and MongoDB that allows you to add inputs, outputs and see all your extract.

My first full stack application with the concept of a personal wallet that allows you to create a personal account to keep track of your entire statement by adding incoming and outgoing transactions, as well as calculating the total balance and being able to edit and delete old transactions.

Jun 23, 2022

Converts your IPv4 address to a 4x4 2-bit PNG which you can extract the IP from.

Converts your IPv4 address to a 4x4 2-bit PNG which you can extract the IP from.

IP-to-PNG Converts your IPv4 address to a 4x4 2-bit PNG which you can extract the IP from. https://www.npmjs.com/package/ip2png Run npm install ip2png

Nov 30, 2022

Extract data-like things from a website on the fly.

There-should-be-an-API Oh, I think this website needs an API. Extract data-like things from a website on the fly. Demo The demo API is hosted on a 256

Mar 26, 2022

The ICPverse service to extract the metadata from an existing NFT project on the Internet Computer.

Infinity_Rank The ICPverse service to extract the metadata from an existing NFT project on the Internet Computer. Requirements for Use: npm installed

Nov 5, 2022

Compare 👥 and extract 🔬 differences between JSON values

json-changes Easily compare changes in JSON-like values. This can be especially useful when you want to compare an incoming payload with a counterpart

Aug 23, 2022

Tiny and fast utility to extract all possible values for a given enum.

Tiny (208B) and fast utility to extract all possible values for a given enum.

Apr 18, 2022
Comments
  • hyphenation in extracted highlights isn't cleaned up

    hyphenation in extracted highlights isn't cleaned up

    In some cases, the extracted text is literally correct, but in a way that makes it hard to read. I believe this plug-in is stripping newlines, which is great! I wish that it would also notice when a word has been hyphenated, and in that case, strip the hyphen and the newline and join the word back together.

    In my PDF: as a result such policies and prac- tices have pushed

    What currently gets extracted: " as a result such policies and prac- tices have pushed"

    What I wish were extracted instead: (and this is in fact what gets copied when selecting the multi-line source in Obsidian's PDF viewer and pasting elsewhere) " as a result such policies and practices have pushed"

    enhancement 
    opened by owldown 1
  • Extracted highlights sometimes omit final character(s).

    Extracted highlights sometimes omit final character(s).

    Playing with this for the first time today, and it seems pretty nifty. I've noticed that on some of the PDFs in my library, the extracted highlights omit one or two of the last characters.

    For example, in the PDF, I highlighted "experienced by their ancestors." but what was extracted was "experienced by their ancestor" (missing the s and period).

    opened by owldown 3
Releases(1.0.4)
Owner
null
I'm trying to create simple program for adding the digital signature to a pdf file with self-signed certificate. I use node-signpdf and pdf-lib library.

pdf-digital-signature-with-node-signpdf-ejs I'm trying to create simple program for adding the digital signature to a pdf file with self-signed certif

null 5 Dec 25, 2022
A plugin for Obsidian (https://obsidian.md) that adds a button to its search view for copying the Obsidian search URL.

Copy Search URL This plugin adds a button to Obsidian's search view. Clicking it will copy the Obsidian URL for the current search to the clipboard. T

Carlo Zottmann 6 Dec 26, 2022
Plugin to enable PDF files as first class citizens within an Obsidian vault.

Obsidian-PDF Plugin to enable PDF files as first class citizens within an Obsidian vault. What does this plugin do for me? All PDF files in your vault

Noah Cabral 16 Nov 26, 2022
An Obsidian plugin to paste Excel tables as Markdown tables in Obsidian editor.

Obsidian Excel to Markdown Table An Obsidian plugin to paste Excel tables as Markdown tables in Obsidian editor. Demo You can paste the copied Excel d

Ganessh Kumar 108 Jan 4, 2023
This is a plugin for Obsidian (https://obsidian.md). Can open document with `.html` and `.htm` file extensions.

Obsidian HTML reader Plugin This is a plugin for Obsidian (https://obsidian.md). Can open document with .html and .htm file extensions. Obsidian HTML

null 37 Dec 27, 2022
An Obsidian plugin that lets you browse the web within Obsidian.

Obsidian Web Browser An Obsidian plugin that allows you to browse the web within Obsidian using v1.0 tabs. The core functionality of the plugin, rende

Dion Tryban 102 Dec 28, 2022
A logseq plugin to extract tweets. Optionally tag the tweet's author as well.

Logseq Twitter Extractor Plugin If this plugin helps you, I'd really appreciate your support. You can buy me a coffee here. A quick utility to parse a

null 17 Nov 17, 2022
A little JavaScript plugin to generate PDF, XLS, CSV and DOC from JavaScript Object or DOM element only from the frontend!

?? JavaScript Object to csv, xls, pdf, doc and DOM to html generator ?? A little JavaScript plugin to generate PDF, XLS, CSV and DOC from JavaScript O

null 61 Jan 7, 2023
Obsidian-dataview-table-filter-menu - Dynamically created filter menu for dataview tables in obsidian

Dataview table Filter Menu for Obsidian Dynamically created filter menu for data

shiro 17 Sep 24, 2022
Obsidian-Snippet-collection - A collection of snippet to customize obsidian

This repo is a collection of CSS snippets for Obsidian.md. To install them on PC

Mara 110 Dec 22, 2022