Skip to main content

Querying databases from the AP

The Data Engineering Team have developed some simple packages to allow you to query our databases using R and Python on the Analytical Platform. pydbtools is a python module, while dbtools is an R package that uses pydbtools under the hood. You can use these packages to execute SQL queries on Athena databases, bringing the results into your environment as a DataFrame to do further analysis. While these are the only “officially” supported packages, there is also the community built Rdbtools which has additional functionality. It can be used with the understanding that if the package requires fixing or updating, it is the responsibility of those using the package to do so.

R - dbtools

The README in the dbtools repository gives details on how to install and use the package with R. This package is maintained by the data engineering team.

R - Rdbtools

The README provides details. This package is maintained by the analytical platform user community.

Python - pydbtools

The README in the pydbtools repository gives details on how to install and use the package with Python. This package is maintained by the data engineering team.

This page was last reviewed on 5 May 2021. It needs to be reviewed again on 5 May 2022 by the page owner #ask-data-engineering .
This page was set to be reviewed before 5 May 2022 by the page owner #ask-data-engineering. This might mean the content is out of date.