Faaiz Taufiq Profile picture
Nov 23 β€’ 7 tweets β€’ 4 min read
Paper alert: πŸ“œ
We will be presenting our paper "Conformal Off-Policy Prediction" at #NeurIPS2022.
arxiv.org/abs/2206.04405
We present COPP, a novel methodology of quantifying uncertainty in off-policy outcomes. 1/n ImageImage
Given an untested policy and past observational data, how do you find the most likely outcome(s) under this policy without deploying it in the real-world? We solve this problem for contextual bandits using Conformal Prediction, which comes with strong theoretical guarantees. 2/n
Existing OPE methodologies estimate the *average* reward under the new policy. This does not convey information about the distribution of the reward itself. To the best of our knowledge, COPP is the first work which estimates the uncertainty in the reward itself. 3/n Image
Additionally, results from COPP are context dependent, i.e., it provides most likely outcome(s) for a given context X, if we were to choose actions according to the new policy. In doing so, COPP provides granular information about the policy performance. 4/n
COPP is the first step towards off-policy assessment using the uncertainty of reward itself, and could lead to interesting possibilities like robust policy optimisation, by optimising the worst case outcomes. πŸ“ˆ 5/n
I would like to thank my co-authors @jeanfrancois287 (equal contribution), Rob Cornish, @yeewhye, @ArnaudDoucet1, without whom this work would not have been possible. If you're interested in more details, come say hi to us at NeurIPS. 6/n
@jeanfrancois287 @yeewhye @ArnaudDoucet1 If you're interested in more details, come say hi to us at NeurIPSπŸš€. Also, check out a high-level summary of our work at .
n=7

β€’ β€’ β€’

Missing some Tweet in this thread? You can try to force a refresh
γ€€

Keep Current with Faaiz Taufiq

Faaiz Taufiq Profile picture

Stay in touch and get notified when new unrolls are available from this author!

Read all threads

This Thread may be Removed Anytime!

PDF

Twitter may remove this content at anytime! Save it as PDF for later use!

Try unrolling a thread yourself!

how to unroll video
  1. Follow @ThreadReaderApp to mention us!

  2. From a Twitter thread mention us with a keyword "unroll"
@threadreaderapp unroll

Practice here first or read more on our help page!

Did Thread Reader help you today?

Support us! We are indie developers!


This site is made by just two indie developers on a laptop doing marketing, support and development! Read more about the story.

Become a Premium Member ($3/month or $30/year) and get exclusive features!

Become Premium

Don't want to be a Premium member but still want to support us?

Make a small donation by buying us coffee ($5) or help with server cost ($10)

Donate via Paypal

Or Donate anonymously using crypto!

Ethereum

0xfe58350B80634f60Fa6Dc149a72b4DFbc17D341E copy

Bitcoin

3ATGMxNzCUFzxpMCHL5sWSt4DVtS8UqXpi copy

Thank you for your support!

Follow Us on Twitter!

:(