The Quest for Private Database Access

Tuesday - November 28 2023, 04:22 UTC - 1 year ago

tldr #

Researchers have finally achieved private information retrieval, a long sought-after concept in cryptography, and extended it to create a general privacy strategy. The paper, which was presented with a Best Paper Award in June of 2020, provides evidence of a way to build privacy preserving applications without any interactions between the user and the server.

content #

We all know to be careful about the details we share online, but the information we seek can also be revealing. Search for driving directions, and our location becomes far easier to guess. Check for a password in a trove of compromised data, and we risk leaking it ourselves.

These situations fuel a key question in cryptography: How can you pull information from a public database without revealing anything about what you’ve accessed? It’s the equivalent of checking out a book from the library without the librarian knowing which one.

The problem is related to cryptography which means the use of encryption to secure data

Concocting a strategy that solves this problem — known as private information retrieval — is "a very useful building block in a number of privacy-preserving applications," said David Wu, a cryptographer at the University of Texas, Austin. Since the 1990s, researchers have chipped away at the question, improving strategies for privately accessing databases. One major goal, still impossible with large databases, is the equivalent of a private Google search, where you can sift through a heap of data anonymously without doing any heavy computational lifting.

The preprocessing concept was first proposed in the early 2000s

Now, three researchers have crafted a long-sought version of private information retrieval and extended it to build a more general privacy strategy. The work, which received a Best Paper Award in June at the annual Symposium on Theory of Computing, topples a major theoretical barrier on the way to a truly private search.

"[This is] something in cryptography that I guess we all wanted but didn’t quite believe that it exists," said Vinod Vaikuntanathan, a cryptographer at the Massachusetts Institute of Technology who was not involved in the paper. "It is a landmark result." .

The paper suggests a proof that a single server approach can work for certain types of queries

The problem of private database access took shape in the 1990s. At first, researchers assumed that the only solution was to scan the entire database during every search, which would be like having a librarian scour every shelf before returning with your book. After all, if the search skipped any section, the librarian would know that your book is not in that part of the library.

That approach works well enough at smaller scales, but as the database grows, the time required to scan it grows at least proportionally. As you read from bigger databases — and the internet is a pretty big one — the process becomes prohibitively inefficient.

The research introduces a new approach to privacy preserving strategies

In the early 2000s, researchers started to suspect they could dodge the full-scan barrier by "preprocessing" the database. Roughly, this would mean encoding the whole database as a special structure, so the server could answer a query by reading just a small portion of that structure. Careful enough preprocessing could, in theory, mean that a single server hosting information only goes through the process once, by itself, allowing all future users to grab information privately without any more effort.

The paper was presented at the annual Symposium on Theory of Computing with a Best Paper Award in June 2020

For Daniel Wichs, a cryptographer at Northeastern University and a co-author of the new paper, that seemed too good to be true. Around 2011, he started trying to prove that this kind of scheme was impossible. "I was convinced that there’s no way that this could be done," he said.

But in 2017, two groups of researchers published riddles that, if solved, would prove that Wichs was wrong. In the paper, Wichs and his colleagues answered those riddles, showing that queries on preprocessed databases can be made private without any need for special interactions between the user and the server.

The paper was authored by three researchers from Northeastern University

hashtags #

privacypreservingapplications cryptography privatedatabaseaccess privateinformationretrieval

worddensity #

information (6, 1.08%)
database (6, 1.08%)
private (6, 1.08%)
search (5, 0.9%)
without (5, 0.9%)