Nick Radcliffe - Test-Driven Data Analysis in Python | PyData London 2024

Ғылым және технология

PyData
Website: www.pydata.org
LinkedIn: / pydata-global
Twitter: / pydata
Test-driven data analysis is a methodology and open-source Python library for improving quality in data processes. It covers three main areas:
• Testing data (generating constraints and using them to validate new data)
PyData is an educational program of NumFOCUS, a 501(c)3 non-profit organization in the United States. PyData provides a forum for the international community of users and developers of data analysis tools to share ideas and learn from each other. The global PyData network promotes discussion of best practices, new approaches, and emerging technologies for data management, processing, analytics, and visualization. PyData communities approach data science using many languages, including (but not limited to) Python, Julia, and R.
PyData conferences aim to be accessible and community-driven, with novice to advanced level presentations. PyData tutorials and talks bring attendees the latest project features along with cutting-edge use cases.
00:00 Welcome!
00:10 Help us add time stamps or captions to this video! See the description for details.
Want to help add timestamps to our KZread videos to help with discoverability? Find out more here: github.com/numfocus/KZreadVi...

Пікірлер: 2

  • @herewegoagain2
    @herewegoagain28 күн бұрын

    Aren't the constraints restrictive in the sense they're univariate? They're definitely helpful but not exhaustive

  • @herewegoagain2

    @herewegoagain2

    8 күн бұрын

    Most practical model 'failures' are due to relationship breakdowns even if they stay within individual constraints. I understand this library isn't meant to be a drift detection library but I think the current setup would work great with that use case

Келесі