Use AI to Create Synthetic Data from a DataFrame or CSV

Поділитися
Вставка
  • Опубліковано 13 жов 2024

КОМЕНТАРІ • 33

  • @muritalaadebayoisah9155
    @muritalaadebayoisah9155 3 роки тому +5

    Thank you for the wonderful presentation. It was very educative.
    Question1: Can gretel-synthetics be used to create a numerical data frame?
    Question2: What is the minimum number of rows that can be used for training the model in order to generate the synthetic data
    Question3: I tried to use it on a numerical dataset of shape (36,118) but was getting errors during training. I wanted to create a synthetic dataset of shape (200,18) from (36,18). Could you please guide me through. Thank you.

    • @alexanderwatson1786
      @alexanderwatson1786 3 роки тому +2

      Hey Muritala- thank you!
      Re: Question 1- yes, gretel-synthetics should be able to recreate any numeric or text-based data in a Dataframe
      Question 2- The more rows and diversity you have in the data, the better the model will learn. We generally recommend 500+ rows of input data.

  • @notnikhilkanwar
    @notnikhilkanwar Рік тому +2

    Hey! i like your video i just have one question does this generated data have the same distribution like the training data ?

    • @gretel_ai
      @gretel_ai  Рік тому

      The distribution of our synthetic data matches the real world data very closely- you can use the SQS report to compare the real world vs. synthetic data. Cheers
      * gretel.ai/blog/how-accurate-is-my-synthetic-data

  • @edwinfromayazona5214
    @edwinfromayazona5214 3 роки тому +1

    Brilliant
    Can we get the notebook used in this tutorial?

    • @gretel_ai
      @gretel_ai  3 роки тому

      Here you go, let us know how the notebook works for you! github.com/gretelai/gretel-blueprints/blob/main/docs/notebooks/create_synthetic_data_from_a_dataframe_or_csv.ipynb

    • @positive_thinker
      @positive_thinker 5 місяців тому

      @@gretel_ai it works perfectly ,thanks

  • @hinaque4505
    @hinaque4505 3 роки тому +1

    I am really interested in creating synthtic data, what ML basics should I know before trying this?

    • @gretel_ai
      @gretel_ai  3 роки тому +1

      Hey Hina! No ML experience is required to try Gretel, and you can get started building your first model through our web interface if you'd like at console.gretel.cloud.
      If you would like to read up on how Gretel works under the hood, here's a technical blog that's a good start: gretel.ai/blog/using-generative-differentially-private-models-to-build-privacy-enhancing-synthetic-datasets-from-real-data

  • @chandrimadebnath2859
    @chandrimadebnath2859 3 роки тому +1

    sir, i just start my work on synthetic data geneartion. can you help me on like what are the prerequisite needed. it will be a grwat help for me.
    thank you.

    • @gretel_ai
      @gretel_ai  3 роки тому

      Hey Chandrima- here is a link to our latest docs on pre-requisites. All you will need is an environment that can run Python, and you can use the Python command line interface (CLI) or SDKs to build synthetic data. docs.gretel.ai/environment-setup

  • @bthapa94
    @bthapa94 2 роки тому +1

    where can I access the collab code above to play around in my environment?

    • @gretel_ai
      @gretel_ai  2 роки тому

      Here are links to our Colab notebooks for our SDKs and use cases. Good luck! docs.gretel.ai/synthetics/synthetic-sdk-notebooks

  • @mersancanonigo2933
    @mersancanonigo2933 3 роки тому +1

    Thank you sir...

    • @gretel_ai
      @gretel_ai  3 роки тому

      Thanks Mersan! Check out the new version of the video here: ua-cam.com/video/_lKgxrDCxrA/v-deo.html

  • @randomruzan13
    @randomruzan13 2 роки тому

    Can we create synthetic data for parallel aligned sentences?.
    Since it will be a trial to create synthetic data for translation purpose.

    • @gretel_ai
      @gretel_ai  2 роки тому +1

      Yes, you could create two columns (sentence1, sentence2) that you can use for training and synthesis. Good luck and feel free to email us if you have any questions hi (at) gretel.ai

  • @efremyohannes2334
    @efremyohannes2334 3 роки тому

    Is there a way to produce the model performance report for Time Series synthetic data generation as with other examples shown?

    • @gretel_ai
      @gretel_ai  3 роки тому

      You can generate and view the model performance report the same way as with the other examples- e.g.
      import IPython
      from smart_open import open
      IPython.display.HTML(data=open(model.get_artifact_link("report")).read())
      Good luck!

  • @bholobadshahg11
    @bholobadshahg11 3 роки тому

    Hi, I followed the example. However, model.build() generates an error, KeyError: 'checkpoint_dir'

    • @gretel_ai
      @gretel_ai  3 роки тому

      Hey Owais- feel free to connect with our team on Slack (gretel.ai/slackinvite) or send us an email at support@gretel.ai. Also, here's a link to a notebook which is tested with our latest code- github.com/gretelai/gretel-blueprints/tree/main/docs/notebooks Best of luck!

  • @leandroborges7941
    @leandroborges7941 2 роки тому

    When I try to do all this in Jupyter Notebook, it gives error "Failed to validate credentials"

    • @gretel_ai
      @gretel_ai  2 роки тому

      Leandro- you need to generate an API key from console.gretel.cloud and copy that key into your browser. Good luck! If you have any questions, you can reach us on our Slack community at gretel.ai/slackinvite