{ "cells": [ { "cell_type": "markdown", "metadata": {}, "source": [ "# String\n", "\n", "[[Polars Documentation](https://docs.pola.rs/api/python/stable/reference/expressions/string.html)]\n", "\n", "In the string module, OpenDP currently only supports parsing to temporal data types." ] }, { "cell_type": "code", "execution_count": 1, "metadata": {}, "outputs": [], "source": [ "import polars as pl\n", "import opendp.prelude as dp\n", "dp.enable_features(\"contrib\")\n", "\n", "context = dp.Context.compositor(\n", " # Many columns contain mixtures of strings and numbers and cannot be parsed as floats,\n", " # so we'll set `ignore_errors` to true to avoid conversion errors.\n", " data=pl.scan_csv(dp.examples.get_france_lfs_path(), ignore_errors=True),\n", " privacy_unit=dp.unit_of(contributions=36),\n", " privacy_loss=dp.loss_of(epsilon=1.0, delta=1e-7),\n", " split_evenly_over=2,\n", ")" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "## Strptime, To Date, To Datetime, To Time\n", "\n", "Dates can be parsed from strings via `.str.strptime`, and its variants `.str.to_date`, `.str.to_datetime`, and `.str.to_time`." ] }, { "cell_type": "code", "execution_count": 2, "metadata": {}, "outputs": [ { "data": { "text/html": [ "
| YEAR | len | 
|---|---|
| date | u32 | 
| 2005-01-01 | 342193 | 
| 2006-01-01 | 339683 | 
| 2007-01-01 | 350429 | 
| 2008-01-01 | 348574 | 
| 2009-01-01 | 416966 | 
| 2010-01-01 | 500385 | 
| 2011-01-01 | 517166 | 
| 2012-01-01 | 515460 | 
| 2013-01-01 | 480615 |