Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Some issues if try to use it in a real life :) #37

Open
Denys88 opened this issue Nov 22, 2023 · 1 comment
Open

Some issues if try to use it in a real life :) #37

Denys88 opened this issue Nov 22, 2023 · 1 comment

Comments

@Denys88
Copy link

Denys88 commented Nov 22, 2023

Hi, I tried to use your product but got a lot of small issues and found some lack of functionality.

  1. Please don't expect that git address is something line this:


    I tried to use it with the internal github which is different from public github and got an error.

  2. There are a lot of cases where your error messages are useless like in first example.
    I tried to use higgsfield manually and got a lot of messages like 'something is not a string'.
    Quick debug helped me to find that I forgot or put wrong command line parameter. It could be improved.

  3. LLama and hugging face:

    tokenizer=LlamaTokenizer.from_pretrained("meta-llama/Llama-2-7b-hf"),

    When I import llama loader it automatically tries to get access to the HF without any my permitions. Overall trying to access something from internet without explicit calls is a big red flag from the security of view. In my case I've already downloaded everything and don't need to connect to the HG at all.

  4. Would be nice to see more examples:

  • very simple manually implemented architecture which supports deepspeed/zero distribution training.
  • example which show how to manually run everything without github and hf access.
  • ability to run your code on a single machine - single gpu and single machine multiple gpu too.
    Because how do you expect people to debug their code?
    I wanted to run a simple example without setting up my machines and using github and found it impossible which is a big problem in my opinion/

Overall great job and nice implementation but it could be much user friendlier.
Thanks!

@arpanetus
Copy link
Collaborator

Hey! Thanks a lot for your thorough inspection.

  1. A nice catch. We haven't thought about it.
  2. Yes, the errors are ill-defined for now. The case with "something is not a string" happens since we parse AST directly without any proper analysis (line numbers, etc.).
  3. We'll change the logic soon, so you can use the locally downloaded models, datasets with higgsfield without any implicit calls.
  4. Overall, we are going to provide more tutorials. We're quite sorry for not having them right now.

    very simple manually implemented architecture which supports deepspeed/zero distribution training.

    • Right now our API provides a support for major LLMs. You can implement your own if you're eager to.

    example which shows how to manually run everything without github and hf access.

    • We chose github just only for usability. It's not a big deal to make a converter into gitlab or other github-like services which has a concept of CI/CD. Yet you don't have to depend of Hugging Face when it comes to datasets or models. The current impl provides a way to do so.

    ability to run your code on a single machine + single gpu and single machine + multiple gpu.

    • We'll make it happen in upcoming updates.
      Appreciate it very much.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants