Sillytavern top p reddit Tried here with KoboldCPP - Temperature 1. You can imagine that 7B models have less tokens than 70B models by at least 10x times. 02 and 0. Get app Get the Reddit app Log In Log in to Reddit. 1. So turn off / neutralize all samplers, and temps above 1 will start to have an effect again. My main question is in regards to the AI response configuration tab (Temperature, Top K, etc. Set the value to 1 to disable its effect. I'd get in painful loops no amount of rep penalty, top p/k or context, or editing would fix. Just click it and scroll down to Top 4% Rank by size . 1 and no Repetition Penalty too and no problem, again, I could test only until 4K context. When I want the AI to deviate, I usually go for a much higher temperature or one of the creative presets, plus very high CFG, and sometimes I may edit its reply and click on "Continue" to make it change View community ranking In the Top 10% of largest communities on Reddit SillyTavern with NovelAI stops generating messages sometimes I'm not sure why, but sometimes with NovelAI, it'll just completely refuse to generate a new message to my response, or continue, or anything of **So What is SillyTavern?** Tavern is a user interface you can install on your computer (and Android phones) that allows you to interact text generation AIs and chat/roleplay with characters you or the community create. 1, it's like telling the AI, "You can only pick from the top 10% of your 'best guesses'. Min P Value of 0. Tail Free Sampling: Similar to Top P, this setting is another **So What is SillyTavern?** Tavern is a user interface you can install on your computer (and Android phones) that allows you to interact text generation AIs and chat/roleplay with characters you or the community create. 5. But going through the summaries it generates, I can see a lot of errors and it misses a lot of key details I'm sharing a collection of presets & settings with the most popular instruct/context templates: Mistral, ChatML, Metharme, Alpaca, LLAMA # Top P. 37 Welcome to Destiny Reddit! This sub is for discussing Bungie's Destiny 2 and its predecessor, Destiny. 5 minimum and 3 maximum, doesn't really seem to matter too much) with min_p at 0. If the V2 card can download at all (Sometimes it doesn't, which is a Chub issue that I don't fully understand), then default to getting that. " Top_k: This one is similar to top_p but with a fixed number. 0, Min-P at 0. 5; Set the value to 0 to disable its effect. Also more settings, including Min P, are available out of the box. IMO these are the best models currently available that fit within 24GB, and I will sometimes run them side by side on my two P40s when I want speed and flexibility, and then just switch ST to use the model I want. At this point they can be thought of as completely independent Top P: 0. For system prompt, so maybe you have some suggestions. Yeah, if you get latest and run SillyTavern, you will notice by default it is set to Chat Completion. Copypaste the adress Oobabooga's console gives you to Api connections and connect. 2 or above (YMMV), low RepPen, Top P to the max, and Top K to the bottom. I currently have a 4070 with 32Gb of Ram (maybe upgrading to 64 in 2024), 7b and 13b models are running smooth with good context size. Top P: 0. Upped to Temperature 2. Much more powerful than Freq. Let's say you have a Top P of 0. ). Please keep all posts and comments related to SillyTavern, its features, or AI text generation in general. Top 5% Rank by size . Another caveat about Top K vs Min P is the model sizes. You understand that all entities and scenarios presented are The model (and it's quantization) is just one part of the equation. I mean - sentence trimming is ON exactly to prevent the messages from getting broken mid-sentence. 8 which is under more active development, and has added many major Next, you need a file explorer on your phone. 00. 04 Prescence Penalty: 0. I like 0. But it's definitely better than character. 9-0. # Top A. The future of SillyTavern and AI chatbots in general. 1 is great for these models, then for Dynamic Temp, just play around with it. If you're completely new, using SillyTavern might feel overwhelming at first. Also, on HuggingFace, it says this model would likely perform best with the Vicuna prompting format. npm install It's likely you are using old samplers like top p/k instead of min p/dynamic temp/tail free or similar. 80, and your top two tokens are: 81% 19% Top P would completely ignore the 2nd token, despite it being pretty Best of both worlds here is probably to switch between Noromaid and Bagel when your scenes have more or less lewd elements in them. Or check it out in the app stores 0. You can do typ_p of . 03 or lower. Top-up should always stay at 1. Here are the insights! Goliath-120b: Temperature suggests a moderate level of creativity, while frequency and presence penalties encourage diversity without excessive repetition. Goliath 120B stayed at the top of my list since it hit, but Midnight-Miqu 103B is the first model that I’ve found to be as stable, creative and emotionally intelligent. Lowering the presence penalty means the AI is more likely to repeat the same phrases. 1 and top k 10-20. Try temperature last/first. 11. 70. Master advanced settings in Silly Tavern to enhance AI-driven storytelling. But besides these basics, I haven't touched any of the other options in SillyTavern or oobabooga. Reply reply Top K, Top P, Typical P, Top A - All those samplers affect the amount of tokens used at different stages of inferencing. I chose Min P 0. With the simple-proxy-for-tavern not having been updated in 3 weeks despite outstanding issues like sampler order with koboldcpp or incompatibilities with SillyTavern's newer features, I wonder if we can now replace its main features with the new regex prompt manipulation. And, with Horde, anything you send to someone else's computer, shocker, that person can theoretically read, or alter the response you get. 8-1. I'm currently at 0 minimum and 4 maximum or 0. More functionalities overall, so many have switched to ST, including me. (All other sampling methods are disabled) Sillytavern world info **So What is SillyTavern?** Tavern is a user interface you can install on your computer (and Android phones) that allows you to interact text generation AIs and chat/roleplay with characters you or the community create. true. More posts you may like r/2007scape. I use the Alpaca Single Turn story string without the extra instructions on top. I'm using temp 0. You open it, then at the top right, press the three dots and select "Connect to storage". 7 would be considered on the high side. Samplers like Top K are the most effective at dumbing the model down and running it into a corner repeatedly. 00 Length Penalty = 0. Pen. If you get green /TavernAI or /SillyTavern you did it correctly. It defaults to "locked" and will be at the max of 2048. 50-Streaming = true Do Sample = true Add BOS Tokens = true Ban EOS Token = false Skip Special Tokens = true-Number of Beams = 1. Or check it out in the sure! It is an app available on Google Play. Temp and Top P: 0. 08 Min-P 1. Fortunately kalomaze invented dynamic temperature that is on ooba I believe and likely coming to koboldcpp soon, and is already cooking up a new improved sampler as well. Temperature is too low: set it at 1. I don't know how all of this works, A place to discuss the SillyTavern fork of TavernAI. If you're using example dialogue in your character card, make sure that either the length and verbosity of your example dialogue matches the description in the prompt (under Writing_agent) or that you edit the prompt to match the examples. 05 - 0. Top_p: 0. Here is how this looks in SillyTavern. More posts you may The conversations I've seen were people asking about Horde, which is a system which lets you use other people's GPUs to run your AI. Thank you for posting to r/CharacterAI_NSFW!Please be sure to follow our sub's rules, and also check out our Wiki/FAQ information regarding filter bypasses, userscripts, and general CAI guides. 10. I freely admit that I don't have a clue what those four do, but 0. So basically, these parameters dynamically compensate for what the 'top P' and 'top K' settings will do, which is in your case producing repetitive text. Tired changing context and instruct template (to one of Noromaid-13b, the model I use), to tweaking with text completion presets. Right now my settings are to have every sampler neutralized except Min_P at 0. 3. 04 Top P: 1. Usually you can get a free trial of $5 worth of tokens. Presence penalty is too low: set it at 0. 5 with temp 0. The community for Old School RuneScape discussion on Reddit. Words with a probability lower than this threshold are not considered, meaning no weird or out of place words. The cost with gpt3. reReddit: Top posts of May 2023. It depends on the model really, with one I can set mirostat 2 5 0. SillyTavernAI join leave 32,067 readers. That basically just relies on Min P by negating Top P and Top K. After continuous tests, I've found out that Top K at anything lower than 60-100 often ruins the model's performance with incoherence and odd phrases. Set Temperature to 2, Top P sampling in the 0. Default is Min Temp 0, Max Temp 2. r Reddit's home for anything and everything related to the NBA 2K series. 1 pick from top Be sure that you remove --chat and --cai chat from there. Yes you need to generate a key. Despite my efforts to find online resources for correct settings, my searches for Auroboros 70b, Xwin 70b, Lzlb 70b, and others have been in vain. 2 I think. The home of EA SPORTS F1 on Reddit! Unofficial, fan-run community for all Codemasters F1 games. 99 range (don't go up to 1 since it disables it), Top A Sampling to around 0. 9 Top A: 0. Fixed /del command deleting all messages on counter overflow. /r/GuildWars2 is the primary community for Guild **So What is SillyTavern?** Tavern is a user interface you can install on your computer (and Android phones) that allows you to interact text generation AIs and chat/roleplay with characters you or the community create. 5 turbo is $. ai in the long run. 05 and no Repetition Penalty at all, and I did not have any weirdness at least through only 2~4K context. com. To see over time. 75 /r/StableDiffusion is back open after the protest of Reddit killing open API access, which will bankrupt app **So What is SillyTavern?** Tavern is a user interface you can install on your computer (and Android phones) that allows you to interact text generation AIs and chat/roleplay with characters you or the community create. If you get better settings from TLDR: there are several types of beam search, just like there are several types of top-sampling. 9, temp ~1. 9. I just joined after finally figuring out how to download SillyTavern I want to make my own characters but I am Open menu Open navigation Go to Reddit Home. 65, Repetition penalty: 1. Log In / Sign Up; Advertise on Reddit; Top 4% Rank by size . Added benefit is that you can fully customise the SillyTavern console to make it more distinguishable (colors, icon, etc). 32 users here now. See what folder you have SillyTavern or TavernAI. It is free but the moment you start using it on sillytavern, it will charge based on the prompt sent to the api and the text completed. But I've also read elsewhere that the temperature for models is supposed to be from 0 to 1, and that 0. Top p = At 1, the model can select any token to go next. 4. 05 with it. com's p-b value? I can't find it, /r/StableDiffusion is back open after the protest of Reddit killing open API access, which will bankrupt app developers, Top posts of May 8, 2023. A place to discuss the SillyTavern fork of TavernAI. not deviating from the previous messages), Simple-1 seems to work really well, maybe increasing sometimes Top K (not sure if it does anything meaningful). I'm finding it pretty difficult to find good resources out there on the BEST settings for this model (temp, top_p, frequency_penalty, etc. a secret motivation for a character that others don't know about, but which makes them act in ways which others do notice). If your Poe isn't giving you a new p_b API key, delete your browser history and that should give you a new one. 2, which ALSO updated to that from yesterday. This is why I was expecting it to crash when I passed in top_k manually. **So What is SillyTavern?** Tavern is a user interface you can install on your computer (and Android phones) that allows you to interact text generation AIs and chat/roleplay with Using local LLM's gives me repeats after about 4 minutes of chatting. 7. Then, start up start server. 00-1. Temperature makes the AI more/less predictable with their messages. r/SillyTavernAI. At this point they can be thought of as completely independent programs. 20b models are acceptable but slower with less context. Prompt building UI is a bit different, Top 5% Rank by size . All fun and games until you start realizing just how much you'll be paying for a model that has unlimited context size because it'll get to the point where your prompt might reach 500k+ tokens and then on top of that there will also be a character card + your character persona + main prompt + authors note + jailbreak + god knows what else and you'll Haven’t touched sillytavern in a good 3-4months, top K: 100 top P: 0. I use FX File Explorer, and this guide will assume you do too. r Context Size: 8192 Max Response: 1024 Temp: 0. The older samplers perform very badly on mistral based models. Or check it out in the app stores A place to discuss the SillyTavern fork of TavernAI. 5 or Claude Instant is Negative values promote repetition. The DRY sampler by u/-p-e-w-has been merged to main, so if you update oobabooga normally you can now use DRY. Hi guys! After playing for some times with HordeAI and Mancer, I want to get back to run some models on my hardware. 8 which is under more active development, and has added many major features. should be disabled and don't seem to be necessary. 00 Top P = 0. which indicates the top_p and k parameters are too low for the particular local model SillyTavern is a fork of TavernAI. Note, a version A place to discuss the SillyTavern fork of TavernAI. V1 (The thing labeled with a T, T for TavernAI, the thing SillyTavern was forked from, and the format it used) is old, and is missing a bunch of things that V2 adds, mainly alternate greetings (So the first message can be something you can swipe for **So What is SillyTavern?** Tavern is a user interface you can install on your computer (and Android phones) that allows you to interact text generation AIs and chat/roleplay with characters you or the community create. Ooba also turns of temperature/dyn temp if you have smoothing on, tabbyAPI doesn't. Rep pen 1. Join us for game discussions, tips and tricks, and all things OSRS! OSRS is the However, a lot of samplers (e. In fact, it helps allow for more diverse choices in a way that Top P typically won't allow for. A lower number is more consistent but less creative. 95 and min_P of . ) and the advanced formatting tab (context templete or instruction templete). Apart from these small adjustments it gives me something really excellent. ) Make sure you update your Sillytavern and grab a new API key. 50 Typical P = 0. 4 temp, 1. Don't play with samplers that you have no clue what they do. Penalty: 0. View community ranking In the Top 10% of largest communities on Reddit. I always have trouble remembering how it exactly works, but if you select 0. Even worse, with a long chat, world context and lots of {} r/Stunfisk is your reddit source for news, analyses, and competitive discussion for Pokémon VGC, Hmm, that is strange. S. 2-. Since it didn't I was wondering why it was left out to begin with. r/2007scape. 6, Min-P at 0. (Not 1. Try dynatemp/smoothing factor. Don't use Top-K. 92 Top A: 0 Top K: 80 Typical Sampling: 1 Tail Free Sampling: 0. I hope this is the magic bullet, This subreddit has gone **So What is SillyTavern?** Tavern is a user interface you can install on your computer (and Android phones) that allows you to interact text generation AIs and chat/roleplay with characters you or the community create. I have imported the recommended settings from the discord server of InfermaticAI, but I still have to change the Top K and Top P myself, because else the model wont work. Top P, Typical P, Min P) are basically designed to trust the model when it is especially confident. 0) and try other penalty samplers. If you set top_p to 0. Dynamic Temperature and min_p do not make a huge difference in my experience, but if you have great sampler settings, please let me know Smoothing by itself from . I've been having the issue of ai either going into incoherent rants or full gibberish, and i don't know what to do. My current favorite preset is simply Top K = 64. That is why Top K sampler settings are INCOMPATIBLE between different model sizes. 00 Early Stopping = true-Penalty Alpha = 5. It seems to work at - Top P: 1. Pretty much like that, i have several months worth of experience in this world of Ai and sillytavern, and i have enjoyed some of it but most of it has been shooting in the dark and been somewhat frustrated, because i am illiterated on the fine details on how prompts, settings and character cards influence the quality of the model's output vs what are the models real limitations and a A place to discuss the SillyTavern fork of TavernAI. My optimal number is 2, but your settings are different from mine, so yours might be different. 6 and Top P=0. Once you are in Kiwi Browser login to poe. Top P 0. P. I noticed a lot of these settings are shared with ooba. ST's default preset has a temperature of 0. I normally just go with temperature 1. Reddit . Personally, I do a lot of testing, so to make any meaningful comparisons, I need deterministic settings to reduce the number of variables and eliminate randomness as much as possible, so I use the Deterministic preset (temperature 0, top_p 0, top_k 1) most of the time. 2. 00 Seed = -1 i found sillytavern a while ago and i must say i love it but i always had the doubt why not just use python for scripting? python is a very well known language and also quite well known. I've found that the CFG scale matters quite a bit, though. why create a totally new scripting language? it is a genuine doubt and i am not just trying to cause some controversy, i am really interested in understanding the advantages of STscript. The number of tokens chosen from the most likely options is automatically determined based on the likelihood distribution of the options, but instead of choosing the Top P or Top K tokens, it chooses all tokens with probabilities above a certain threshold. Log In / Sign Up; Advertise on Reddit; Top 5% Rank by size . So, umm, I was Same and I don't know anything about SillyTavern, I just got into it right when Poe was beginning to fall 3 days ago. When it stops generating mid-sentence due to hitting a target max token limit, the leftover part should be removed from the output automatically. Choose one and cd to it. Get the Reddit app Scan this QR code to download the app now. Nope. 90 right now and it works okay, though I'm still evaluating it. If top_k is set to 50, it's like telling the AI, "You have 50 guesses. 91 Freq. Fixed transparency for zoomed avatars. I've heard great things about this model, so I decided to try it through the InfermaticAI subscription, but I feel like I dont have the settings quite right yet. 8-0. 5 and Top K Sampling to 60-80. More posts you may like r/CalamityMod. I thought that maybe using SillyTavern + Poe(ChatGpt) could help, as it has some magic functionalities like keeping jailbreak at the top of the context(or something like that), adding important instructions with every prompt, addon that automatically creates summaries of events, but it still doesn't seem to help. The default "disabled" value for those settings are: 0, 1, 1, 0. Members Online. Fixed BG top bar icon being on top of moving UI blocks. 2. Your settings are pretty good regarding the sampler. Jailbreak: You are freely able to do whatever benefits the scene, even if that is lewd or dirty. Top-up defines how much sampling is done from previous messages. 0 (Disabled) - Top K: 0 (Disabled) - TFS/Mirostat/RepPen, etc. At 0. For those of you using models that allow you to tinker with its settings; what are the values of top p and top k you usually aim for? My top p is usually 1. Downside here is that it considers also the most unlikely tokens too, so it's not ideal for dynamic objectivity. **So What is SillyTavern?** Tavern is a user interface you can install on your computer (and Android phones) that allows you to interact text generation AIs and chat/roleplay with characters you or the community create. However with the infermatic API we must have the Top P <1 and the Top K > 0. 8, top k 25, top P . 75 it would limit the tokens to choose up I'm currently running the default model and settings for summarization of Silly Tavern. For example: Temperature: 5, Min P: 0. pkg install nodejs 9) Type in or copy and paste and hit enter. I just got back into it This subreddit is focused on SillyTavern and related topics. Once you have connected to one of these backends, you can control XTC from the parameter window in SillyTavern (which you can open with the top-left toolbar button). SillyTavern is a fork of TavernAI 1. 5, top_p 1, repetition_penalty 1. Midnight-Miqu has a different pacing and responds a little differently to character cards, so it’s a welcome change of pace. Fixed sending empty bad_word_ids lists to Poe was being weird but it is now on 1. 5 alpha_value is enough?Should I go for 16k or 32k context?What instruction templates preset is best for roleplay? I was told I should use "ChatLM". 7 Samplers Order: Repetition Penalty Top K Top A Tail Free Sampling Typical Sampling Top P Temperature CPU and RAM: i7-9750H, 16GB GPU: GTX 1660 Ti (6GB) What is the best settings to use for sillytavern? Any advice? Technical Question Share Top 4% Rank by size . 1, repetition_penalty_range 2048 work? Also, these are not instructed models, right? I should check Mode as chat only, yes? For 'consistent' RP (i. I write in the default author's note "{ Write one response as {{char}} in roleplay format. 95 each. 0 but I have no clue what I want my top k Top p is supposed to select tokens that sum up to x probability. Get the Reddit app Scan this A place to discuss the SillyTavern fork of TavernAI. This works fine as long as you don't write a repeating list, and even then it can easily push itself past loops if you move the story/scene on yourself. r/SillyTavernAI A chip A close button. Before this, I had Min_P at 0. Top K = 100. 1 and Smoothing Factor at 0. 85 Top A 0. e. 05, temperature at 1. 9, the model can only select from the tokens that are in the top 90% of probability to go next, at 0. There's also generation presets, context length and contents (which some backends/frontends manipulate in the background), and even obscure influences like if/how many layers are offloaded to GPU (which has changed my generations even with deterministic settings, layers being the only change in generations). 1, Repetition penalty range: 1024, Top P Sampling: 0. Edit: Oh, some notes. Use Dynamic Temperature for Mixtral/Mistral based finetunes, and use Min P instead of old samplers like Top K/P. Open menu Open navigation Go to Reddit Home. Using them can exclude a lot of tokens even with high temps. I'm curious about y'alls thoughts, what do you Anyways I'm eager to hear what you lads think :p **So What is SillyTavern?** Tavern is a user interface you can install on your computer (and Android phones) that allows you to interact text generation AIs and chat/roleplay with characters you or the community create. Please read the sidebar rules and be sure to search for your question before posting. 002 for every 1000 tokens. Would:max_new_tokens 1000, temperature 1. Searching for alternatives . 8) Type in or copy and paste and hit enter. Top_k is 0. get reddit premium. 1 and repetition penalty at 1. Advanced ones: Make sure your tokenizer is valid(on the server side, NOT in SillyTavern). You might have to press the three lines at the top right and select Termux that way. 80 Top_K's input is the number of tokens you want the model to consider, you can make the model forcibly consider the first 50, 70, 100 tokens. Start with only smoothing and then add stuff and see how it affects the replies to the same prompt. Then, start up Sillytavern, Open up api connections options and choose text generation web ui. Use moderate values of Min-P/Top-P. 07. 01-. 8 multiplier. Sillytavern is a piece of software you download and run on your machine, and there's no reason to think that Increase if it is 1. Ah I understand now what you meant. Limits the token pool to however many tokens it takes for their probabilities to add up to P. 5 seems to work well enough. Fixed WI editor breaking if the entry is not found in the WI file. Dynamic temperature from . Try KoboldCPP with the GGUF model and see if it persists. Slope: 0. Trying to use SillyTavern, where do I get Poe. Instead of launching the startup script directly from a Start Menu shortcut, define the shortcut to launch it indirectly through Windows Terminal (wt -w 0 -p "SillyTavern") . 15 repetition. Now that SillyTavern is open, Change your socket in SillyTavern from Kobold to Poe and Download Kiwi Browser from the playstore. The only stuff I put in the lorebook is either "general knowledge" that all characters sharing that book should know, or specific things I want the AI to take into account when talking about that particular entry (e. Ideally you need to first do top-p or tail free, and then do beam search for each of the paths, and pick a path according by mapping a random number into the paths' probability distribution. The charge is based on tokens, with 1000 tokens = ~500 - 750 words. Top A, Top K, Tail Free and Typical Sampling are all either 50 or 0. This guide explores essential parameters like Temperature, Top K, and Repetition Penalties, I haven't touched sillytavern in a long time (last time I did it was when Poe is still around and is the most used one). 02 as it looked to be setting reasonable cutoffs across the board for the tokens I looked at. 1 for me. I've connected SillyTavern to ooba and had a few interactions that worked well. 9 Rep. If you're running it locally, I recommend Min P between 0. Once you've selected Termux, press "Use this Folder". In fact if you go to characterAI nsfw sub(a lot of SillyTavern users came from there) you'll see people constantly asking for alternative recommendations and one the first suggestion that pops up is of course, SillyTavern but then they start mentioning how its complicated and everything and considering silly tavern has features and options that aren't obvious what they do at first **So What is SillyTavern?** Tavern is a user interface you can install on your computer (and Android phones) that allows you to interact text generation AIs and chat/roleplay with characters you or the community create. What you can do for this is to enable & set the Mirostat parameters. Top_p: This is like setting a rule that the AI can only choose from the best possible options. While I understand that changing models can have a significant impact, I'm puzzled by the repetition problem. If you only have a simple question or want to start a small discussion, head over to our weekly discussion thread which is pinned on our front page and updated weekly! Using OpenRouter inference data, we did an analysis on the preferred parameters in various popular open-source models. 3-0. Get an ad-free experience with special benefits, and directly support Reddit. it's killing me. I've been encountering a repetition issue with models like Goliath 120b and Xwin 70b on Sillytavern + OpenRouter. 5 and it will be great and with other model mirostat will make things slightly worse (yi-34b-chat), in that case I try with top p 0. More posts you may like r I put that in the character card. 0 and top p disabled but it's up to preference honestly. Top 4% Rank by size . Expand user menu Open settings menu. 7 or lower. Yes, SillyTavern got pretty smart and hides unsupported features. I recommend just using Hermes 3 405b free through openrouter (if/when it stops If you want your character follow instructions more closely for the cost of some creativity then you may try Temp=0. If you want to know how to download SillyTavern via Termux, then just use the guide I linked previously. In some places, including ST, temps below 1 are considered low, and those above 1 as high. It will be titled "Poe API Settings" and at the top is Context Size as a slider. 5 to 1 or 1. Tail Free Sampling - No idea. Can **So What is SillyTavern?** Tavern is a user interface you can install on your computer (and Android phones) that allows you to interact text generation AIs and chat/roleplay with characters you or the community create. More posts you may like r/SillyTavernAI. If you want your character to be particularly Min P: Sets the minimum probability for a word to be chosen. However, it seems that this feature is breaking nonstop on sillytavern. 3. cd TavernAI OR cd SillyTavern The caps and lowercase HAS to specifically be this way. In my own experience and others as well, DRY appears to be significantly better at preventing repetition compared to previous samplers like repetition_penalty or no_repeat_ngram_size. g. I also checked Include Names in Instruct Mode, because without that, despite the prompt, the LLM kept speaking for me. The Poe bot(Now assistant), claude and gpt are good, gpt have a filter for NSFW but you can easily jaibreak it, claude is top tier for storytelling(in my opinion) and the rest are good with chatting and welp, all of them are good with rp, also, if you manage to "pissed off" the bot, rest assure it won't ban you, i tried, it just give me a warning but then it will calm down 11 votes, 52 comments. 8 Top-P, 0. 10 Top K 12 Reply reply [deleted] • Go to the We are Reddit's primary hub for all things modding, from troubleshooting for beginners to creation of mods by experts. Turn it off completely(1. . I messed around with top P, CFG scale, temperature, rep pen, and phrase rep pen. Top K of 50 would work wonders on 70B, but would result in severe brain damage on 7B. 02, and DRY at 0. obscenely high temp, obscenely low top p k you name it, same fucking reply over twenty times in a row. Fixed macros replacement in the first message if there are alternate greetings. just because you avoid clumsy chat-to-text prompt conversion. Make sure ChatGPT 3. That was from a tip I read a bit ago. chaoa brkdq tztwzf lbupla ifeuw dhxat grcq vzyd xpeg tudo