this post was submitted on 11 Sep 2023
7 points (70.6% liked)

Autism

6867 readers
4 users here now

A community for respectful discussion and memes related to autism acceptance. All neurotypes are welcome.

We have created our own instance! Visit Autism Place the following community for more info.

Community:

Values

  • Acceptance
  • Openness
  • Understanding
  • Equality
  • Reciprocity
  • Mutuality
  • Love

Rules

  1. No abusive, derogatory, or offensive post/comments e.g: racism, sexism, religious hatred, homophobia, gatekeeping, trolling.
  2. Posts must be related to autism, off-topic discussions happen in the matrix chat.
  3. Your posts must include a text body. It doesn't have to be long, it just needs to be descriptive.
  4. Do not request donations.
  5. Be respectful in discussions.
  6. Do not post misinformation.
  7. Mark NSFW content accordingly.
  8. Do not promote Autism Speaks.
  9. General Lemmy World rules.

Encouraged

  1. Open acceptance of all autism levels as a respectable neurotype.
  2. Funny memes.
  3. Respectful venting.
  4. Describe posts of pictures/memes using text in the body for our visually impaired users.
  5. Welcoming and accepting attitudes.
  6. Questions regarding autism.
  7. Questions on confusing situations.
  8. Seeking and sharing support.
  9. Engagement in our community's values.
  10. Expressing a difference of opinion without directly insulting another user.
  11. Please report questionable posts and let the mods deal with it. Chat Room
  • We have a chat room! Want to engage in dialogue? Come join us at the community's Matrix Chat.

.

Helpful Resources

founded 1 year ago
MODERATORS
 

Hey everyone, I've been searching for a bit on getting local LLM inference to process legal paperwork (I am not a lawyer, I just have trouble through large documents to figure out my rights). This would help me have conversations with my landlord and various other people who will withhold crucial information such as your rights during a unit inspection or accuse you of things you did not etc.

Given that there are 1000s of pre-trained models, would it be better to train a small model myself on an RTX 4090 or a Daisy chain of other GPUs? Is there a legal archive somewhere that I'm just not seeing or where should I direct my energy? I think lots of us could benefit from a pocket law reference that can serve as an aid to see what to do next.

top 8 comments
sorted by: hot top controversial new old
[–] [email protected] 9 points 1 year ago* (last edited 1 year ago) (2 children)

Generally, training an llm is a bad way to provide it with information. “In-context learning” is probably what you’re looking for. Basically just pasting relevant info and documents into your prompt.

You might try fine tuning an existing model on a large dataset of legalese, but then it’ll be more likely to generate responses that sound like legalese, which defeats the purpose

TL;DR Use in context learning to provide information to an LLM Use training and fine tuning to change how the language the llm generates sounds.

[–] [email protected] 2 points 1 year ago (1 children)

I know nothing about “in context learning” or legal stuff, but intuitively, don’t legal documents tend to reference each other, especially the more complicated ones? If so, how would you apply in context learning if you’re not aware which ones may be relevant?

[–] [email protected] 5 points 1 year ago* (last edited 1 year ago) (1 children)

Yes, you can craft your prompt in such a way that if the llm doesn’t know about a referenced legal document it will ask for it, so you can then paste the relevant section of that document into the prompt to provide it with that information.

I’d encourage you to look up some info on prompting LLMs and LLM context.

They’re powerful tools, so it’s good to really learn how to use them, especially for important applications like legalese translators and rent negotiators.

[–] [email protected] 1 points 1 year ago (1 children)

thanks for your answer! Is this same or different from indexing to provide context? I saw some people ingesting large corpus of documents/structured data, like with LlamaIndex. Is it an alternative way to provide context or similar?

[–] [email protected] 2 points 1 year ago

Indexing and tools like llamaindex use LLM generated embeddings to “intelligently” search for similar documents to a search query.

Those documents are usually fed into an LLM as part of the prompt (eg. context)

[–] [email protected] 2 points 1 year ago

I'll read more into "in context learning" and see if I can figure out something useful from the vast corpora of datasets out there.

I guess I can't relegate my thinking entirely to a mathematically optimized black box, but one can hope that it could help point me in the direction to understand my rights in my housing complex.

[–] [email protected] 1 points 1 year ago* (last edited 1 year ago) (1 children)

When you find one or successfully train one, I’d love to know as well. Maybe you can crosspost this on?

I saw this dataset on HuggingFace, does it fit your use case? https://huggingface.co/datasets/lexlms/lex_files

[–] [email protected] 2 points 1 year ago

Hi there! Looks like you linked to a Lemmy community using a URL instead of its name, which doesn't work well for people on different instances. Try fixing it like this: [email protected], [email protected]