The formats of the birthday column aren’t consistent. AFAIK, they are YYYY-MM-DD, e.g. 1745-04-02, before the 20th century and mm/dd/YYYY, e.g. 2/1/1900, in the 20th century. As a result, I get the following unfriendly error. (It’d be nice if ETL stuff was covered before being thrown this curveball.)

Traceback (most recent call last):
  File "c:\PATHNAME\rpwwpypl4.py", line 13, in <module>
    print(gov.select("last_name", pl.col("birthday").cast(pl.Date), "type", "state"))
          ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "c:\PATHNAME\.venv\Lib\site-packages\polars\dataframe\frame.py", line 9856, in select   
    .collect(optimizations=QueryOptFlags._eager())
     ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "c:\PATHNAME\.venv\Lib\site-packages\polars\_utils\deprecation.py", line 97, in wrapper 
    return function(*args, **kwargs)
           ^^^^^^^^^^^^^^^^^^^^^^^^^
  File "c:\PATHNAME\.venv\Lib\site-packages\polars\lazyframe\opt_flags.py", line 330, in wrapper
    return function(*args, **kwargs)
           ^^^^^^^^^^^^^^^^^^^^^^^^^
  File "c:\PAHTNAME\.venv\Lib\site-packages\polars\lazyframe\frame.py", line 2335, in collect  
    return wrap_df(ldf.collect(engine, callback))
                   ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
polars.exceptions.InvalidOperationError: conversion from `str` to `date` failed in column 'birthday' for 1 out of 64 values: ["12/7/1906"]  

You might want to try:
- setting `strict=False` to set values that cannot be converted to `null`
- using `str.strptime`, `str.to_date`, or `str.to_datetime` and providing a format string

Bartosz Zaczyński RP Team on Aug. 11, 2025

@toigopaul Where do you pull your data from? The attached CSV file uses the YYYY-MM-DD format consistently, e.g.:

Mobley,William,Carlton,,,,1906-12-07,M,rep,GA,6,,Democrat,(...)

toigopaul on Aug. 11, 2025

@Bartosz Zaczyński I got the data from Supporting Material->Sample Code (.zip)

Mobley,William,Carlton,,,,12/7/1906,M,rep,GA,6,,Democrat,,,,,,,,,,M000835,,,,,,407809,,,,6578,Carlton Mobley

Christopher Trudeau RP Team on Aug. 11, 2025

Hi @toigopaul,

I just double checked the original data and that in the Supported Materials ZIP. I’m getting the same result as @Bartosz. Any chance you opened the CSV file with something else first? Or maybe overwrote it? I’d suggest grabbing it again as something has been corrupted along the way.

toigopaul on Aug. 11, 2025

@Christopher Trudeau Mea culpa. Indeed, as I only wanted to extract the csv and not the code, I opened the csv from within the zip using Excel and saved the csv. It should have occured to me that this could cause a transformation. When I extract everything and then inspect the csv, all dates are YYYY-MM-DD.

Christopher Trudeau RP Team on Aug. 11, 2025

Hi @toigopaul,

We seem to be posting past each other in the two different spaces. Short version: Excel is only seeing some of those dates as dates, hence why you ended up with the mix. Thanks for catching the other problem, we’ll get on it.

Become a Member to join the conversation.