RapidAPI

RapidAPI

I’m very pleased to announce that many of the plot2txt API methods are now available via RapidAPI. You can access the example figure search engine described in previous blog posts, as well as the endpoint used to process document pages and produce said search engine. In the same marketplace are various other methods, including theContinue Reading RapidAPI

New api method for data image search

New api method for data image search

As mentioned, after doing some experiments for the KDnuggets article, I bundled some of the existing API methods into a new one, which will extract from a page figures that have x/y scaling information. The JSON output is well suited to elasticsearch or your favorite flavor of NoSQL eg., an extract of the response: [{“input”:”tmp/quant-ph0002044-9.png”,Continue Reading New api method for data image search

Arxiv mining

Arxiv mining

Some time ago I launched a little project, mining data from arxiv; you can read about it in other blog posts. Specifically, I modeled figures from about 500k figures as Gaussian mixture models, in order to create some features, so figures might be ultimately represented as graphs for comparison. More ordinary methods might suffice tooContinue Reading Arxiv mining

API Gateway Perf

API Gateway Perf

At the time of writing, AWS API gateway doesn’t support gzip requests, so I’ve been handling this at the lambda function itself and client side. Obviously compression makes a dramatic difference w.r.t performance, just ask the guys at Pied Piper 🙂 Another curious absence is support for multipart form data; attached a screen grab fromContinue Reading API Gateway Perf