phorge-phorge

mirror of https://we.phorge.it/source/phorge.git synced 2024-11-24 07:42:40 +01:00

Author	SHA1	Message	Date
epriestley	c0b8e4784b	Add a basic, general-purpose export workflow for all objects with SearchEngine support Summary: Depends on D18918. Ref T13046. Ref T5954. Pull logs can currently be browsed in the web UI, but this isn't very powerful, especially if you have thousands of them. Allow SearchEngine implementations to define exportable fields so that users can "Use Results > Export Data" on any query. In particular, they can use this workflow to download a file with pull logs. In the future, this can replace the existing "Export to Excel" feature in Maniphest. For now, we hard-code JSON as the only supported datatype and don't actually make any effort to format the data properly, but this leaves room to add more exporters (CSV, Excel) and data type awareness (integer casting, date formatting, etc) in the future. For sufficiently large result sets, this will probably time out. At some point, I'll make this use the job queue (like bulk editing) when the export is "large" (affects more than 1K rows?). Test Plan: Downloaded pull logs in JSON format. Reviewers: amckinley Reviewed By: amckinley Maniphest Tasks: T13046, T5954 Differential Revision: https://secure.phabricator.com/D18919	2018-01-26 11:15:59 -08:00
epriestley	2914613444	Fix failure to record `pullerPHID` in repository pull logs Summary: See PHI305. Ref T13046. The SSH workflows currently extend `PhabricatorManagementWorkflow` to benefit from sharing all the standard argument parsing code. Sharing the parsing code is good, but it also means they inherit a `getViewer()` method which returns the ommnipotent viewer. This is appropriate for everything else which extends `ManagementWorkflow` (like `bin/storage`, `bin/auth`, etc.) but not appropriate for SSH workflows, which have a real user. This caused a bug with the pull logs where `pullerPHID` was not recorded properly. We used `$this->getViewer()->getPHID()` but the correct code was `$this->getUser()->getPHID()`. To harden this against future mistakes: - Don't extend `ManagementWorkflow`. Extend `PhutilArgumentWorkflow` instead. We only want the argument parsing code. - Rename `get/setUser()` to `get/setSSHUser()` to make them explicit. Then, fix the pull log bug by calling `getSSHUser()` instead of `getViewer()`. Test Plan: - Pulled and pushed to a repository over SSH. - Grepped all the SSH stuff for the altered symbols. - Saw pulls record a valid `pullerPHID` in the pull log. - Used `echo {} \| ssh ... conduit conduit.ping` to test conduit over SSH. Reviewers: amckinley Reviewed By: amckinley Maniphest Tasks: T13046 Differential Revision: https://secure.phabricator.com/D18912	2018-01-23 14:09:42 -08:00
epriestley	6b99aac49d	Digest changeset anchors into purely alphanumeric strings Summary: Ref T13045. See that task for discussion. This replaces `digestForIndex()` with a "clever" algorithm in `digestForAnchor()`. The new digest is the same as `digestForIndex()` except when the original output was "." or "_". In those cases, a replacement character is selected based on entropy accumulated by the digest function as it iterates through the string. Test Plan: Added unit tests. Reviewers: amckinley Reviewed By: amckinley Maniphest Tasks: T13045 Differential Revision: https://secure.phabricator.com/D18909	2018-01-23 13:42:08 -08:00
epriestley	5a8a56f414	Prepare the new AuthPassword infrastructure for storing account passwords Summary: Ref T13043. In D18898 I moved VCS passwords to the new shared infrastructure. Before account passwords can move, we need to make two changes: - For legacy reasons, VCS passwords and Account passwords have different "digest" algorithms. Both are more complicated than they should be, but we can't easily fix it without breaking existing passwords. Add a `PasswordHashInterface` so that objects which can have passwords hashes can implement custom digest logic for each password type. - Account passwords have a dedicated external salt (`PhabricatorUser->passwordSalt`). This is a generally reasonable thing to support (since not all hashers are self-salting) and we need to keep it around so existing passwords still work. Add salt support to `AuthPassword` and make it generate/regenerate when passwords are updated. Then add a nice story about password digestion. Test Plan: Ran migrations. Used an existing VCS password; changed VCS password. Tried to use a revoked password. Unit tests still pass. Grepped for callers to legacy `PhabricatorHash::digestPassword()`, found none. Reviewers: amckinley Reviewed By: amckinley Maniphest Tasks: T13043 Differential Revision: https://secure.phabricator.com/D18900	2018-01-23 10:57:40 -08:00
epriestley	7970cf0585	Add a `bin/auth revoke` revoker for temporary tokens Summary: Ref T13043. Allows CLI revocation of temporary ("forgot password", "one-time login") tokens. Test Plan: Used "Forgot Password?" to generate tokens, used `bin/auth revoke --type temporary` with `--from` and `--everywhere` to revoke them. Reviewers: amckinley Reviewed By: amckinley Maniphest Tasks: T13043 Differential Revision: https://secure.phabricator.com/D18891	2018-01-22 12:00:33 -08:00
epriestley	d9b6513a21	Respect tokenizer limits in the bulk editor Summary: Ref T13025. This makes limits (for fields like "Assign To") work in the bulk editor, so you can't type "Assign to: x, y, z" anymore. Test Plan: Hit limit for "Assign to" and a custom project field. No limit for "Add subscribers". Reviewers: amckinley Reviewed By: amckinley Maniphest Tasks: T13025 Differential Revision: https://secure.phabricator.com/D18888	2018-01-22 11:55:55 -08:00
epriestley	fbfcc37531	Respect token limits for "Assign to" and custom datasource fields in Herald Summary: See PHI173. Currently, Herald has an "Assign to" action for tasks, and you can specify custom fields with datasource values (like users or projects) that have a limit (like 1 "Owner", or 12 "Jury Members"). Herald doesn't support these limits right now, so you can write `[ Assign to ][ X, Y, Z ]`. This just means "Assign to X", but make it more clear by actually enforcing the limit in the UI. Test Plan: - Created a "projects" custom field with limit 1. - Tried to create actions that 'assign to' or 'set custom field to' more than one thing, got helpfully rebuffed by the UI. - Created an "add subscribers" action with more than one value. Reviewers: amckinley Reviewed By: amckinley Differential Revision: https://secure.phabricator.com/D18887	2018-01-22 11:54:12 -08:00
epriestley	3038d564a6	Allow bulk edits to be made silently if you have CLI access Summary: Fixes T13042. This hooks up the new "silent" mode from D18882 and makes it actually work. The UI (where we tell you to go run some command and then reload the page) is pretty clumsy, but should solve some problems for now and can be cleaned up eventually. The actual mechanics (timeline aggregation, Herald interaction, etc.) are on firmer ground. Test Plan: - Made a normal bulk edit, got mail and feed stories. - Made a silent bulk edit, no mail and no feed. - Saw "Silent Edit" marker in timeline for silent edits: {F5386245} Reviewers: amckinley Reviewed By: amckinley Maniphest Tasks: T13042 Differential Revision: https://secure.phabricator.com/D18883	2018-01-19 13:24:54 -08:00
epriestley	7a43181337	Organize bulk edit actions into nice groups Summary: Ref T13025. We're getting kind of a lot of actions, so put them in nice groups so they're easier to work with. Test Plan: {F5386038} Reviewers: amckinley Reviewed By: amckinley Maniphest Tasks: T13025 Differential Revision: https://secure.phabricator.com/D18880	2018-01-19 13:22:25 -08:00
epriestley	ae1b07bcfb	Support "<select />" custom fields in bulk editor Summary: Ref T13025. Fixes T5689. A straightforward change! Test Plan: Used the bulk editor to modify a custom "select" field like the one in T5689. Reviewers: amckinley Reviewed By: amckinley Maniphest Tasks: T13025, T5689 Differential Revision: https://secure.phabricator.com/D18879	2018-01-19 13:18:02 -08:00
epriestley	b6737554e1	Support tokenizer custom fields in bulk editor Summary: Ref T13025. This allows custom tokenizer fields, like a "Owning Group" field, to be edited with the bulk editor. See PHI173 for some context. Test Plan: Edited a custom "Owner" field (a project tokenizer) with the bulk editor. Reviewers: amckinley Reviewed By: amckinley Maniphest Tasks: T13025 Differential Revision: https://secure.phabricator.com/D18877	2018-01-19 13:16:46 -08:00
epriestley	a26cf20dd1	Fix a bug with setting custom PHID list field values via Conduit and prepare for bulk edits Summary: Ref T13025. Custom field transactions work somewhat unusually: the values sometimes need to be encoded. We currently do not apply this encoding correctly via Conduit. For example, setting some custom PHID field to `["PHID-X-Y"]` fails with a bunch of JSON errors. Add an extra hook callback so that EditTypes can apply processing to transaction values, then apply the correct CustomField processing. This only affects Conduit. In a future diff, this also allows bulk edit of custom fields to work correctly. Test Plan: Added a custom field to Maniphest with a list of projects. Used Conduit to bulk edit it (which now works, but did not before). Used the web UI to bulk edit it. Reviewers: amckinley Reviewed By: amckinley Maniphest Tasks: T13025 Differential Revision: https://secure.phabricator.com/D18876	2018-01-19 12:51:35 -08:00
epriestley	8dccf05c4c	Manually set "max_allowed_packet" to 1GB for "mysqldump" Summary: We have one production instance with failing database backups since they recently uploaded a 52MB hunk. The production configuration specifies a 64MB "max_allowed_packet" in `[mysqld]`, but this doesn't apply to `mysqldump` (we'd need to specify it in a separate `[mysqldump]` section) and `mysqldump` runs with an effective limit of the default (16MB). We could change our production config to specify a value in `[mysqldump]`, but just change it unconditionally at execution time since there's no reason for any user to ever want this command to fail because they have too much data. Test Plan: Dumped locally, will verify production backup goes through cleanly. Reviewers: amckinley Reviewed By: amckinley Differential Revision: https://secure.phabricator.com/D18834	2017-12-20 10:29:02 -08:00
epriestley	c7d6fd198c	Support "Set X to" as an action in Herald for tokenizer/datasource custom fields Summary: See PHI173. Adds custom field support for Herald actions, and implements actions for "Datasource/Tokenizer" fields. The only action available for now is "set field to...". Other actions ("Add values", "Remove values") might make sense in the future for these fields, but there's currently no use case. For most other field types (text, select, checkbox, etc) only "Set to" makes sense. Test Plan: - Added a "datasource" custom field to the custom field definition in Config. - Added a "if field is empty, set field to default value X" rule to Herald. - Created a task with a nonempty field: no Herald trigger. - Created a task with an empty field: Herald fired. - Reviewed rule and transcripts for text strings. {F5297615} {F5297616} {F5297617} Reviewers: amckinley Reviewed By: amckinley Differential Revision: https://secure.phabricator.com/D18784	2017-11-28 13:41:52 -08:00
epriestley	1d213dc1fa	Clean up virtual "_ft_rank" column for query construction of Ferret objects Summary: Ref T12974. Ferret object queries SELECT a virtual "_ft_rank" column for relevance ordering. Currently, they always SELECT this column. That's fine and doesn't hurt anything, but makes developing and debugging things kind of a pain since every query has this `, blah blah _ft_rank` junk. Instead, construct this column only if we're actually going to use it. Mostly, this cleans up DarkConsole / query logs a bit. Test Plan: Viewed normal query results on various pages, viewed global search results, ordered Maniphest tasks by normal stuff and by "Relevance". Viewed DarkConsole, saw no more "_ft_rank" junk on normal pages. Reviewers: amckinley Reviewed By: amckinley Maniphest Tasks: T12974 Differential Revision: https://secure.phabricator.com/D18728	2017-10-23 16:18:04 -07:00
epriestley	157f47cd14	Rewrite CommitQuery to use UNION for performance Summary: Ref T12680. See PHI167. See that task for discussion. Rewrite `DiffusionCommitQuery` to work more like `DifferentialRevisionQuery`, and use a UNION to find "all revisions you need to audit OR respond to". I tried to get this working a little more cleanly than RevisionQuery does, and can probably simplify that now. Test Plan: Poked at the UI locally without hitting any apparent issues, but my local data is pretty garbage at this point. I'll take a look at how the query plans work on `secure`. Reviewers: amckinley Reviewed By: amckinley Maniphest Tasks: T12680 Differential Revision: https://secure.phabricator.com/D18722	2017-10-23 10:32:24 -07:00
epriestley	c5e8de9450	Make `bin/storage dump` insert CREATE DATABASE and USE statements Summary: Ref T13000. The new approach for dumping database-by-database means that we don't get CREATE DATABASE or USE statements, which makes importing the dump again inconvenient. Manually stitch these into the dump. Test Plan: - Used `bin/storage dump --namespace ...` to dump a smaller local instance. - Used `bin/storage destroy --namespace ...`, to destroy the namespace, then inported the dump cleanly. - Verified that each CREATE DATABASE statement appears only once. - Verified that `bin/storage renamespace --live` can correctly process this file. Reviewers: amckinley Reviewed By: amckinley Maniphest Tasks: T13000 Differential Revision: https://secure.phabricator.com/D18707	2017-10-13 14:35:18 -07:00
Dmitri Iouchtchenko	9bd6a37055	Fix spelling Summary: Noticed a couple of typos in the docs, and then things got out of hand. Test Plan: - Stared at the words until my eyes watered and the letters began to swim on the screen. - Consulted a dictionary. Reviewers: #blessed_reviewers, epriestley Reviewed By: #blessed_reviewers, epriestley Subscribers: epriestley, yelirekim, PHID-OPKG-gm6ozazyms6q6i22gyam Differential Revision: https://secure.phabricator.com/D18693	2017-10-09 10:48:04 -07:00
epriestley	4fd9d2d4bb	Fix "bin/storage dump" with no "--output" Ref T13004. (I distinctly remember testing this, but must have tweaked things afterward.)	2017-10-07 13:23:18 -07:00
epriestley	1ee7b3ab8c	Correct "bin/storage dump" command construction with passwords Fixes T13004. This should mirror the other branch.	2017-10-07 04:59:29 -07:00
epriestley	c767c971ca	Add "persistence" types (data, cache, or index) to tables, and tweak what "storage dump" dumps Summary: Ref T13000. This marks each table as either "data" (normal data), "cache" (automatically rebuilt, no need to ever dump) or "index" (can be manually rebuilt). By default, `bin/storage dump` dumps data and index tables, but not cache tables. With `--no-indexes`, it dumps only data tables. Indexes can be rebuilt after a restore with `bin/search index --all ...`. Test Plan: - Ran `--no-indexes` and normal dumps with `--trace`, verified that cache and index (former case) or cache only (latter case) tables were dumped with `--no-data`. - Verified dump has the same number of `CREATE TABLE` statements as before the changes. - Reviewed persistence tags in the web UI (note Ferret engine tables are "Index"): {F5210886} Reviewers: amckinley Reviewed By: amckinley Maniphest Tasks: T13000 Differential Revision: https://secure.phabricator.com/D18682	2017-10-04 12:09:33 -07:00
epriestley	02e1440ef2	Dump tables one at a time, rather than all at once Summary: Ref T13000. This allows us to be more selective about which tables we dump data for, to reduce the size of backups and exports. The immediate goal is to make large `ngrams` tables more manageable in the cluster, but this generally makes all backups and exports faster and easier. Here, tables are dumped one at a time. A followup change will sometimes add the `--no-data` flag, to skip dumping readthrough caches and (optionally) rebuildable indexes. Test Plan: Compared a dump from `master` and from this branch, found them to be essentially identical. The new dump has a little more header information in each section. Verified each contains the same number of `CREATE TABLE` statements. Reviewers: amckinley Reviewed By: amckinley Maniphest Tasks: T13000 Differential Revision: https://secure.phabricator.com/D18679	2017-10-04 12:08:52 -07:00
epriestley	0ea5d668d1	Enable hovercards for the "Task Graph" UI in Maniphest Summary: See PHI118. Enables hovercards to support peeking at tags and other details if you, e.g., create numerous identical subtasks of each task. Test Plan: {F5210816} Reviewers: amckinley Reviewed By: amckinley Differential Revision: https://secure.phabricator.com/D18681	2017-10-04 11:12:01 -07:00
epriestley	1de130c9f5	Allow the Ferret engine to remove "common" ngrams from the index Summary: Ref T13000. This adds support for tracking "common" ngrams, which occur in too many documents to be useful as part of the ngram index. If an ngram is listed in the "common" table, it won't be written when indexing documents, or queried for when searching for them. In this change, nothing actually writes to the "common" table. I'll start writing to the table in a followup change. Specifically, I plan to do this: - A new GC process updates the "common" table periodically, by writing ngrams which appear in more than X% of documents to it, for some value of X, if there are at least a minimum number of documents (maybe like 4,000). - A new GC process deletes ngrams that have been added to the common table from the existing indexes. Hopefully, this will pare down the ngrams index to something reasonable over time without requiring any manual tuning. Test Plan: - Ran some queries and indexes. - Manually inserted ngrams `xxx` and `yyy` into the ngrams table, searched and indexed, saw them ignored as viable ngrams for search/index. Reviewers: amckinley Reviewed By: amckinley Maniphest Tasks: T13000 Differential Revision: https://secure.phabricator.com/D18672	2017-10-03 13:27:42 -07:00
epriestley	94ab0c9afb	Spell "Relevance" correctly Summary: Despite how I (and everyone else?) pronounce it, it is spelled with an "a". See PHI38. Test Plan: Googled both spellings. Reviewers: amckinley Reviewed By: amckinley Differential Revision: https://secure.phabricator.com/D18622	2017-09-18 09:36:55 -07:00
epriestley	fdc0d8c2f6	Fix an issue with selecting the right stemmed ngrams with Ferret engine queries Summary: Ref T12819. In D18581, I corrected one bug (ngram selection for terms) but introduced a minor new bug. We now pass `' query '` (term corpus with boundary spaces) to the stemmer, but it bails out on this since English words don't start with spaces. Trim these extra boundary spaces off before invoking the stemmer. The practical effect of this is that searching for non-stem variations of a word ("detection") now finds stemmed variations again ("detect"). Prior to fixing this bug, the stem could find longer variations but not the other way around. Test Plan: Searched for "detection", found results matching "detect" after patch (and saw same results for "detect" and "detection"). Reviewers: chad Reviewed By: chad Maniphest Tasks: T12819 Differential Revision: https://secure.phabricator.com/D18593	2017-09-12 12:13:42 -07:00
epriestley	39b74572e6	Return fulltext tokens from the Ferret fulltext engine Summary: Ref T12819. These render the little "Searched For: X, Y, U V" hint about how something was parsed. (This might get a "substring" color or "title only" color or something in the future.) Test Plan: {F5178807} Reviewers: chad Reviewed By: chad Maniphest Tasks: T12819 Differential Revision: https://secure.phabricator.com/D18589	2017-09-11 18:04:56 -07:00
epriestley	c662dda0f1	When selecting Ferret ngrams, select term ngrams (not raw ngrams) for term search Summary: Ref T12819. For queries like `v0.2`, we would incorrectly search for ngrams including `0.2`, but this is only a substring ngram: the term corpus splits this into `v0` and `2`, so `0.2` is not in the ngrams table. When executing term queries, search for term ngrams instead. This makes "v0.2" work properly again. Test Plan: Searched for "v0.2", found a task with "v0.2" in the title. Reviewers: chad Reviewed By: chad Maniphest Tasks: T12819 Differential Revision: https://secure.phabricator.com/D18581	2017-09-08 09:47:58 -07:00
epriestley	4cae4a3b76	Correct `bin/storage analyze` internal API for cluster environments Summary: Ref T12819. This worked right in a non-cluster environment, but `bin/storage upgrade` iterates over each master in a partitioned cluster environment. Tweak the API so `bin/storage analyze` targets a single host but `bin/storage upgrade` can hit all the masters. Test Plan: Will run `bin/storage upgrade` in production again. Ran `upgrade` and `analyze` locally, still work fine. Reviewers: chad Reviewed By: chad Maniphest Tasks: T12819 Differential Revision: https://secure.phabricator.com/D18576	2017-09-07 16:35:26 -07:00
epriestley	8e9f049626	Provide "bin/storage analyze" and make "bin/storage upgrade" run analysis automatically Summary: Ref T12819. Normallly "ANALYZE TABLE" is like sprinkling magic pixie dust on the database and hoping it will make "good vibes" that cause it to go faster, but in at least some concrete cases with the ngrams tables there really was a key cardinality issue which ANALYZE TABLE corrected, fixing bogus query plans. Add `bin/storage analyze` to analyze all tables, and make `bin/storage upgrade` run it after adjustment if `--no-adjust` is not specified, and make `bin/storage adjust` run it always. This runs in a couple seconds and should never hurt anything, so it should be fine to sprinkle lots of pixie dust into the `bin/storage` workflow. Test Plan: Ran `bin/storage analyze`. Ran `bin/storage upgrade`, saw analyze run. Totally felt great vibes and really aligned chakras on the database. Reviewers: chad Reviewed By: chad Maniphest Tasks: T12819 Differential Revision: https://secure.phabricator.com/D18573	2017-09-07 14:44:34 -07:00
epriestley	2218caee0f	Reduce the amount of boilerplate that implementing FerretInterface requires Summary: See brief discussion in D18554. All the index tables are the same for every application (and, at this point, seem unlikely to change) and we never actually pass these objects around (they're only used internally). In some other cases (like Transactions) not every application has the same tables (for example, Differential has extra field for inline comments), and/or we pass the objects around (lots of stuff uses `$xactions` directly). However, in this case, and in Edges, we don't interact with any representation of the database state directly in much of the code, and it doesn't change from application to application. Just automatically define document, field, and ngram tables for anything which implements `FerretInterface`. This makes the query and index logic a tiny bit messier but lets us delete a ton of boilerplate classes. Test Plan: Indexed objects, searched for objects. Same results as before with much less code. Ran `bin/storage upgrade`, got a clean bill of health. Reviewers: chad Reviewed By: chad Maniphest Tasks: T12819 Differential Revision: https://secure.phabricator.com/D18559	2017-09-07 13:23:31 -07:00
epriestley	a2a2b3f7f4	Sort global fulltext results by overall relevance Summary: Ref T12819. Currently, under the Ferret engine, we query each application's index separately and then aggregate the results. At the moment, results are aggregated by type first, then by actual rank. For example, all the revisions appear first, then all the tasks. Instead, surface the internal ranking data from the underlying query and sort by it. Test Plan: Searched for "A B" with a task named "A B" and a revision named "A". Saw task first. Broadly, saw mixed task and revision order in result sets. Reviewers: chad Reviewed By: chad Maniphest Tasks: T12819 Differential Revision: https://secure.phabricator.com/D18551	2017-09-07 13:21:58 -07:00
epriestley	8059db894d	Use the Ferret engine fulltext document table to drive auxiliary fulltext constraints Summary: Ref T12819. I started trying to get individual engines to drive these constraints (e.g., `ManiphestTaskQuery` can do most of the work) but this is a big pain, especially since most engines don't support "any owner" or "no owner", and not everything has an owner, and so on and so on. Going down this path would have meant a huge pile of stub functions everywhere, I think. Instead, drive these through the main engine using the fulltext document table, which already has everything we need to apply these constraints in a uniform way. Also tweak some parts of query construction and result ordering. Test Plan: Searched for documents by author, owner, unowned, any owner, tags, subscribers, fulltext in global search. Got sensible results without any application-specific code. Reviewers: chad Reviewed By: chad Maniphest Tasks: T12819 Differential Revision: https://secure.phabricator.com/D18550	2017-09-07 13:21:42 -07:00
epriestley	395a2ed6d1	Add an "only()" edge logic constraint, meaning "only the other constraints, exactly" Summary: See PHI57. For example, a query for "ios, only()" finds tags tasked with iOS, exactly, and no other tags. I called this "only()" instead of "exact()" because we use the term/function "Exact" elsewhere with a different meaning, e.g. in Differential. Test Plan: Basic query for a tag: {F5168857} Same query with "only", finds tasks tagged with only that tag: {F5168858} Reviewers: chad Reviewed By: chad Differential Revision: https://secure.phabricator.com/D18543	2017-09-06 12:16:06 -07:00
epriestley	64b7778f32	Add support for relevance-ranking Ferret engine results Summary: Ref T12819. "Relevance" here just means "how many of your search terms are present in the title?" but that's about the best we can do anyway. Test Plan: Indexed tasks "A B", "A Z", "Z B", and "Z Z" (all with "A B" in comments). Searched for "A B". Got results ranked in the listed order, with "A B" as the most relevant hit for query "A B". Reviewers: chad Reviewed By: chad Maniphest Tasks: T12819 Differential Revision: https://secure.phabricator.com/D18539	2017-09-05 16:45:20 -07:00
epriestley	20aad35e60	Move Ferret engine "title:..." field definitions to the engine itself Summary: Ref T12819. Move these out of the core engine into the Ferret engine. In the future different applications can define different functions, like "summary:..." or whatever. This may get more formalization when I possibly do "author:" and such some time down the road. Test Plan: Searched for "title:...". Searched for "dog:...", got a useful error. Reviewers: chad Reviewed By: chad Maniphest Tasks: T12819 Differential Revision: https://secure.phabricator.com/D18536	2017-09-05 11:57:51 -07:00
epriestley	46abc11114	Reduce the number of magic strings in the Ferret implementation Summary: Ref T12819. Push more of the magic `' '` stuff into the engine and simplify calls to ngram construction. Also fixes a bug where a task with title "apple banana" and description "cherry doughnut" could match query "banana cherry" by separating separate term segments with newlines instead of spaces. Test Plan: - Indexed some objects. - Searched (term, substring, quoted terms). - Viewed index in database. Reviewers: chad Reviewed By: chad Maniphest Tasks: T12819 Differential Revision: https://secure.phabricator.com/D18534	2017-09-05 11:57:35 -07:00
epriestley	4a7593f47f	Consolidate more Ferret engine code into FerretEngine Summary: Ref T12819. Earlier I separated some ngram code into an "ngram engine" hoping to share it across the simple Ngrams stuff and the full Ferret stuff, but they actually use slightly different rules. Just pull more of this stuff into FerretEngine to reduce the number of moving pieces and the amount of code duplication. Test Plan: Searched for terms, rebuilt indexes. Reviewers: chad Reviewed By: chad Maniphest Tasks: T12819 Differential Revision: https://secure.phabricator.com/D18533	2017-09-05 11:57:18 -07:00
epriestley	577d498033	Create a virtual "core" field in the Ferret engine for "title and body together" Summary: See PHI46. The `core:` function means "find results in either the title or body, but not other auxiliary fields like comments". Test Plan: Searched for text present in the title (yes), body (yes), and comments (no) with the `core:...` prefix. Reviewers: chad Reviewed By: chad Differential Revision: https://secure.phabricator.com/D18514	2017-09-01 09:40:56 -07:00
epriestley	f4f73e0a7e	Separate fulltext engine extensions into "enrich" and "index" phases Summary: Ref T12819. Some of the extensions "enrich" the document (adding more fields or relationships), while others "index" it (insert it into some kind of index for later searching). Currently, these are all muddled under a single "index" phase. However, the Ferret extension cares about fields and relationships which other extensions may add. Split this into two phases: "enrich" adds fields and relationships so other extensions can read them later if they want. "Index" happens after the document is built and has all the fields and relationships. The specific problem this solves is that comments may not have been added to the document when the Ferret extension runs. By moving them to the "enrich" phase, the Ferret engine will be able to see and index comments. Test Plan: Ran `bin/search index ...`, grepped for `indexFulltextDocument`. Reviewers: chad Reviewed By: chad Maniphest Tasks: T12819 Differential Revision: https://secure.phabricator.com/D18513	2017-09-01 09:40:11 -07:00
epriestley	3b43a70773	Add "title:..." support to the Ferret engine Summary: Ref T12819. Adds (hacky, hard-coded) field support (for now, only for "title"). I've written this so `title:quick ferret` is the same as `title:quick title:ferret`. I think this is what users probably mean. You can do the other thing as `ferret title:quick`, or `title:quick all:ferret`. Test Plan: Searched for `title:x`, `title:"x"`, `title:~"x"`, etc. Searched for "garbage:y", got an exception since that's not a recognized function. Searched for `title:x y`, saw both do title search. Reviewers: chad Reviewed By: chad Maniphest Tasks: T12819 Differential Revision: https://secure.phabricator.com/D18503	2017-08-30 11:30:42 -07:00
epriestley	048aa36c23	Support "-term" in Ferret engine queries Summary: Ref T12819. Supports negating search terms, e.g. "apple -honeycrisp". When negating a term, we're a little more strict about what can match (that is, what can //prevent// a document from being returned) since it's easy for a user to type "apple -honeycrisp -honey -crisp -crispies -olcrispers -honeyyums" to keep refining their search, but hard/impossible to split apart an overboard term. Test Plan: - Ran `apple -smith`, `apple -"granny smith"`, etc. - Verified `phone -tact` does not exclude `phone contact`. - (In theory, `phone -~tact` would, but the parser currently doesn't support this, and I'm not champing at the bit to add support.) Reviewers: chad Reviewed By: chad Maniphest Tasks: T12819 Differential Revision: https://secure.phabricator.com/D18502	2017-08-30 11:30:24 -07:00
epriestley	df9c24e750	Provide some "term vs substring" support for the Ferret engine Summary: Ref T12819. Distinguishes between "term" queries and "substring" queries, and tries to match them correctly most of the time. For example: - `example` matches "example", obviously. - `~amp` matches "example", but `amp` does not. - `examples` matches "example" through stemming. - `"examples"` does not match "example" (quoted text does not stem). - `"an examp"` does not match "an example" (quoted text is still term text). - `~"an examp"` matches "an example" (quoted, substring-operator text uses substring search). Test Plan: Ran searches similar to the above, they seemed to do what they should. Reviewers: chad Reviewed By: chad Maniphest Tasks: T12819 Differential Revision: https://secure.phabricator.com/D18500	2017-08-30 11:30:04 -07:00
epriestley	e5a495f435	Parse raw Ferret queries into tokens before processing them Summary: Ref T12819. Depends on D18492. Instead of passing a raw query into the Query layer, parse it first. This allows the query layer to figure out which parts should be substring vs term match, and would allow the SearchEngine layer to do `author:...` eventually by picking it out before sending it to the Ferret engine. Test Plan: Ran some Ferret queries. They work like before, except that nonsense like `-+"quack"` raises an exception now. Reviewers: chad Reviewed By: chad Maniphest Tasks: T12819 Differential Revision: https://secure.phabricator.com/D18499	2017-08-30 11:29:46 -07:00
epriestley	f97157e7ed	Build a prototype fulltext engine ("Ferret") using only basic MySQL primitives Summary: Ref T12819. I gave this stuff a sweet code name because all the terms related to "fulltext" and "search" already mean 5 different things. It, uh, ferrets out documents for you? I'm building this to work a lot like the existing ngram index, which seems to work pretty well. If this sticks, it will auto-resolve the join issue (in T12443) by letting us do the entire thing locally in a JOIN and thus dodge a lot of mess. This index gets built alongside other indexes, but only shows up in the UI if you have prototypes enabled. If you do, it appears under the existing fulltext field in Maniphest. No existing functionality is affected or disrupted. NOTE: The query engine half of this is still EXTREMELY primitive, and this probably performs worse than the existing field for now. If this doesn't show obvious signs of being awful on `secure` I'll improve that in followup changes. Test Plan: Indexed my tasks, ran some simple queries, got the results I wanted, even for queries "ko", "k", "v0.1". {F5147746} Reviewers: chad Reviewed By: chad Maniphest Tasks: T12819, T12443 Differential Revision: https://secure.phabricator.com/D18484	2017-08-28 14:52:59 -07:00
epriestley	70088f7eec	Continue reducing callsites to ArcanistDifferentialRevisionStatus Summary: Ref T2543. Further consolidates status management into DifferentialRevisionStatus. One change I'm making here is internally renaming "CLOSED" to "PUBLISHED". The UI will continue to say "Closed", at least for now, but this should make the code more clear because we care about "is closed, exactly" vs "is any closed status (closed, abandoned, sometimes accepted)". This distinction is more obvious as `isClosed()` vs `isPublished()` than, e.g., `isClosedWithExactlyTheClosedStatus()` or something. I think "Published" is generally more clear, too, and more consistent with modern language (e.g., "pre-publish review" replacing "pre-commit review" to make it more clear what we mean in Git/Mercurial). I've removed the IN_PREPARATION status since this was just earlier groundwork for "Draft" and not actually used, and under the newer plan I'm trying to just abandon `ArcanistDifferentialRevisionStatus` entirely (or, at least, substantially). Test Plan: - Viewed revisions. - Viewed revision list. - Viewed revisions linked to a task in Maniphest. - Viewed revision graph of dependencies in Differential. - Grepped for `COLOR_STATUS_...` constants. - Grepped for removed method `getRevisionStatusIcon()` (no callsites). - Grepped for removed method `renderFullDescription()` (one callsite, replaced with just building a `TagView` inline). - Grepped for removed method `isClosedStatus()` (no callsites after other changes). Reviewers: chad Reviewed By: chad Maniphest Tasks: T2543 Differential Revision: https://secure.phabricator.com/D18340	2017-08-09 11:05:22 -07:00
epriestley	e9208ed3da	Fix a spelling error in worker triggers Summary: This word is not spelled properly. Test Plan: Read the word. Reviewers: chad Reviewed By: chad Differential Revision: https://secure.phabricator.com/D18250	2017-07-20 14:20:44 -07:00
epriestley	b46e2bb4cc	Convert cluster/projects config options to newer modular structure Summary: Ref T12845. Converts the cluster and project config options to the new stuff; this is mostly just shifting boilerplate around. Test Plan: Edited, deleted, and mangled these options from the web UI and CLI. Reviewers: chad, amckinley Reviewed By: amckinley Maniphest Tasks: T12845 Differential Revision: https://secure.phabricator.com/D18166	2017-06-27 12:35:54 -07:00
epriestley	988a52cf1a	Fix ambiguous URI parsing in Youtube Remarkup rule Summary: Fixes T12867. Also: - Simplify the code a little. - Stop mutating this on text/mobile -- there's no inherent value in the "youtu.be" link so I think this just changes the text the user wrote unnecessarily. Test Plan: {F5013804} Reviewers: chad Reviewed By: chad Maniphest Tasks: T12867 Differential Revision: https://secure.phabricator.com/D18149	2017-06-23 08:43:15 -07:00
epriestley	c71d9c601f	Pass all Throwables to Exception Handlers, not just Exceptions Summary: Ref T12855. PHP7 introduced "Throwables", which are sort of like super exceptions. Some errors that PHP raises at runtime have become Throwables instead of old-school errors now. The major effect this has is blank pages during development under PHP7 for certain classes of errors: they skip all the nice "show a pretty error" handlers and This isn't a compelete fix, but catches the most common classes of unexpected Throwable and sends them through the normal machinery. Principally, it shows a nice stack trace again instead of a blank page for a larger class of typos and minor mistakes. Test Plan: Before: blank page. After: {F5007979} Reviewers: chad, amckinley Reviewed By: chad Maniphest Tasks: T12855 Differential Revision: https://secure.phabricator.com/D18136	2017-06-20 05:44:51 -07:00
epriestley	3be36783b3	Consider inline comments with draft checkmarks as "unsubmitted" Summary: Ref T12733. When a revision has unsubmitted checkmarks: - Color the banner yellow. - Show them in the "X unsubmitted" count. - Make the "X unsubmitted" button cycle between all drafts (written but unpublished comments) and "draft done" (checked but unsubmitted "Done" checkbox comments). Test Plan: - Checked a "Done" box, saw "1 unsubmitted" and yellow banner. - Clicked "5 unsubmitted" repeatedly, saw it cycle through all unsubmitted comments and checkboxes. Reviewers: chad Reviewed By: chad Maniphest Tasks: T12733 Differential Revision: https://secure.phabricator.com/D18127	2017-06-15 05:22:58 -07:00
epriestley	887bd2d66e	In the UI, rename "Hide Inline" to "Collapse Inline" Summary: Ref T12733. This paves the way for a separate "hide" operation which completely hides things. (I didn't extend this to the server side because that would require schema changes and the new "hide" state is client-only.) Test Plan: Collapsed and expanded inlines, viewed tooltips. Reviewers: chad Reviewed By: chad Maniphest Tasks: T12733 Differential Revision: https://secure.phabricator.com/D18126	2017-06-15 05:22:44 -07:00
Chad Little	21d16c7236	Fix cancel button on inline comment view Summary: Switch over to PHUIButtonView Test Plan: Cancel, Edit, Submit new inline diff comment. Reviewers: epriestley Reviewed By: epriestley Subscribers: Korvin Differential Revision: https://secure.phabricator.com/D18120	2017-06-13 13:41:10 -07:00
Chad Little	283a95d2aa	Build a page for viewing all inline comments Summary: Adds a very basic list of all inline comments, threaded, and their status. Kept this a little simpler than the mock, mostly because sorting here feels a little strange given threads would be all over the place. Not sure sorted is needed in practice anyways. I'd probably lean towards just adding a JS checkbox to hide certain rows if needed in the future. Test Plan: Test various commenting structures: - Leave Comment - Update Diff - Leave new comment - Reply to comment - Reply to comment as revision author - Mark items as done - Update diff again {F4996915} Reviewers: epriestley Reviewed By: epriestley Subscribers: Korvin, epriestley Differential Revision: https://secure.phabricator.com/D18112	2017-06-12 11:31:20 -07:00
epriestley	3400f24c8b	Send permanent dameon failures to the log, even when not running in verbose mode Summary: Fixes T12803. An install is having difficulty diagnosing mail failures, and one component is that permanent task failures aren't reaching the log. It's reasonable to send these to the log even when "phd.verbose" is off. See T12803 for a rough review of when we generate these failrues today. Test Plan: - Faked some exceptions. - Got a result in the log (P2058) with `phd.verbose` turned off. Reviewers: chad, amckinley Reviewed By: chad Maniphest Tasks: T12803 Differential Revision: https://secure.phabricator.com/D18106	2017-06-08 15:26:19 -07:00
epriestley	8692d673c8	Fix minor inline comment header button behaviors Summary: Fixes T12806. Ref T12733. - Don't count synthetic (lint) comments as anything. - When you begin writing an inline then cancel it, don't count it as anything. - When we would show "0 / X", just show "X". Test Plan: - Viewed a diff with synthetic comments, no button. - Wrote, then cancelled an inline. No "X comments". - Clicked / unlicked "Done", saw "X" -> "1 / X". Reviewers: chad Reviewed By: chad Maniphest Tasks: T12806, T12733 Differential Revision: https://secure.phabricator.com/D18103	2017-06-07 19:10:12 -07:00
epriestley	709c304d76	Group query results under the "ANCESTOR" operator unconditionally Summary: Fixes T12753. See that task for reproduction instructions. We add a `GROUP BY` clause to queries with an "ANCESTOR" edge constraint only if the constaint has more than one PHID, but this is incorrect: the same row can be found twice by an ANCESTOR query if task T is tagged with both "B" and "C", children of "A", and the user queries for "tasks in A". Instead, always add GROUP BY for ANCESTOR queries. Test Plan: - Followed test plan in T12753. - Saw proper paging controls after change. - Saw `GROUP BY` in DarkConsole. Reviewers: chad Reviewed By: chad Maniphest Tasks: T12753 Differential Revision: https://secure.phabricator.com/D18012	2017-05-24 13:29:25 -07:00
epriestley	20e7f7d0e2	Bump markup engine version to clear old "Navigation Sequence" elements Summary: The tag/shade stuff changed, so purge older markup (like Diviner documents). Test Plan: {F4972666} Reviewers: chad Reviewed By: chad Differential Revision: https://secure.phabricator.com/D17998	2017-05-23 16:28:05 +00:00
Chad Little	00400ae6f9	Search and Replace calls to setShade Summary: grep for setShade and update to setColor. Add deprecated warning. Test Plan: Diffusion, Workboards, Maniphest, Project tags, tokenizer, uiexamples Reviewers: epriestley Reviewed By: epriestley Subscribers: Korvin, O14 ATC Monitoring Differential Revision: https://secure.phabricator.com/D17995	2017-05-22 18:59:53 +00:00
epriestley	4dff754502	Show a snippet when hovering inlines in the objective list Summary: Ref T12733. Shows a comment snippet when hovering inlines in the objective list. Test Plan: {F4968490} Reviewers: chad Reviewed By: chad Maniphest Tasks: T12733 Differential Revision: https://secure.phabricator.com/D17980	2017-05-20 08:00:09 -07:00
epriestley	6c46f27d98	Add quest objectives to the minimap Summary: Add important objectives (like waygates and quest markers) to the minimap. This also probably fixes @cspeckmim's bug with the {key @} keyboard shortcut. Test Plan: (This is probably easier to undestand if you `arc patch` + click around.) {F4966037} Reviewers: chad, amckinley Reviewed By: chad Subscribers: cspeckmim Differential Revision: https://secure.phabricator.com/D17955	2017-05-19 12:01:01 -07:00
Joshua Spence	0ed496de22	Throw an exception if `local.json` can't be read Summary: Our `local.json` configuration file contains various secrets, including database usernames and passwords. As such, we recently changed the permissions on this file from `0644` to `0640`. After doing so, however, I constantly forget to run commands with `sudo`. This is made worse by the fact that `PhabricatorConfigLocalSource` seems to simply ignore `local.json` is it isn't readable, whereas throwing an `Exception` would have saved me a lot of debugging. Test Plan: ```name=Before > /usr/local/src/phabricator/bin/config get mysql.pass { "config": [ { "key": "mysql.pass", "source": "local", "value": null, "status": "unset", "errorInfo": null }, { "key": "mysql.pass", "source": "database", "value": null, "status": "error", "errorInfo": "Database source is not configured properly" } ] } ``` ```name=After > /usr/local/src/phabricator/bin/config get mysql.pass [2017-05-16 21:49:26] EXCEPTION: (FilesystemException) Path '/usr/local/src/phabricator/conf/local/local.json' is not readable. at [<phutil>/src/filesystem/Filesystem.php:1124] arcanist(head=stable, ref.master=3c4735795a29, ref.stable=20ad47f27331), phabricator(head=stable, ref.master=3dae9701298f, ref.stable=fcebaa5097f3), phutil(head=stable, ref.master=a900d7b63e95, ref.stable=d02cc05931b0) #0 Filesystem::assertReadable(string) called at [<phutil>/src/filesystem/Filesystem.php:39] #1 Filesystem::readFile(string) called at [<phabricator>/src/infrastructure/env/PhabricatorConfigLocalSource.php:25] #2 PhabricatorConfigLocalSource::loadConfig() called at [<phabricator>/src/infrastructure/env/PhabricatorConfigLocalSource.php:6] #3 PhabricatorConfigLocalSource::__construct() called at [<phabricator>/src/infrastructure/env/PhabricatorEnv.php:195] #4 PhabricatorEnv::buildConfigurationSourceStack(boolean) called at [<phabricator>/src/infrastructure/env/PhabricatorEnv.php:95] #5 PhabricatorEnv::initializeCommonEnvironment(boolean) called at [<phabricator>/src/infrastructure/env/PhabricatorEnv.php:75] #6 PhabricatorEnv::initializeScriptEnvironment(boolean) called at [<phabricator>/scripts/init/lib.php:22] #7 init_phabricator_script(array) called at [<phabricator>/scripts/init/init-setup.php:11] #8 require_once(string) called at [<phabricator>/scripts/setup/manage_config.php:5] ``` Reviewers: #blessed_reviewers, joshuaspence Reviewed By: joshuaspence Subscribers: Korvin Differential Revision: https://secure.phabricator.com/D17917	2017-05-16 15:12:49 -07:00
epriestley	6a9dd61c42	Make collapsed inlines more useful and anchor target highlights more accurate Summary: Ref T12616. Fixes T11648. Currently, we snug up replies with a negative margin (from T10563) but this throws off the anchor highlighting. Instead: - Remove padding from these dolumns. - Use margins on the stuff inside them instead. - Less margins for replies. - Less margins for collapsed comments. - Show some text for collapsed comments. Test Plan: {F4960890} {F4960891} Reviewers: chad Reviewed By: chad Maniphest Tasks: T12616, T11648 Differential Revision: https://secure.phabricator.com/D17913	2017-05-16 11:09:53 -07:00
epriestley	86b9deb8a9	Move inline anchors up, to dolumn-level Summary: Fixes T8420. Now that hidden inlines no longer fold into a big clump, anchors can just jump to them in a normal way. Move the anchors up a smidge so thing work. Test Plan: Clicked an anchor pointed at a hidden inline, ended up in the right place. Reviewers: chad Reviewed By: chad Maniphest Tasks: T8420 Differential Revision: https://secure.phabricator.com/D17910	2017-05-16 10:11:57 -07:00
epriestley	8052ab84bf	Remove "^" (Prev) and "V" (Next) actions on Differential inline comments Summary: Ref T12616. Fixes T12715. I suspect these are very rarely used. (I think you tried to get rid of them before but I pushed back since we couldn't really offer great alternatives at the time?) Now that the code is in a better place: - Click an inline's header (just the colored part) to select it with the keyboard selection cursor. - Click again to deselect it. - You can use "n" and "p" to jump to comments, so "click + n" is the same as the old "V" action. - This also makes it easier to swap between keyboard and mouse workflows, since you can jump into things with the keyboard at any inline. Also, make "Reply" render more consistently. Test Plan: - Did all that stuff, things seemed to work OK. Reviewers: chad Reviewed By: chad Maniphest Tasks: T12715, T12616 Differential Revision: https://secure.phabricator.com/D17908	2017-05-16 09:44:00 -07:00
epriestley	41379f39de	Move inline replies to new code and remove DifferentialInlineEditor Summary: Ref T12616. This moves "reply" to the new stuff and deletes DifferentialInlineEditor, which no longer does anything. (This breaks some keyboard shortcuts, but I'll rebase D17859 shortly.) Test Plan: Replied to inlines; things seemed to work properly. Reviewers: chad Reviewed By: chad Maniphest Tasks: T12616 Differential Revision: https://secure.phabricator.com/D17894	2017-05-16 06:23:51 -07:00
epriestley	3c18cb77fb	Move inline "done" checkboxing to DiffInline Summary: Ref T12616. This updates clicking the "Done" checkbox for the new stuff. This one is pretty clean since the "Done" checkbox doesn't do too much weird magic. Test Plan: Clicked the box a few times. Reviewers: chad Reviewed By: chad Maniphest Tasks: T12616 Differential Revision: https://secure.phabricator.com/D17888	2017-05-16 06:21:00 -07:00
epriestley	4fd4ec3d27	Hide inlines one-by-one, instead of in a big group Summary: Ref T12616. Fixes T12153. Currently, when you hide inlines, they hide completely and turn into a little bubble on the previous line. Instead, collapse them to a single line one-by-one. Narrowly, this fixes T12153. In the future, I plan to make these changes so this feature makes more sense: - Introduce global "hide everything" states (T8909) so you can completely hide stuff if you want, and this represents more of a halfway state between "nuke it" and "view it". - Make the actual rendering better, so it says "epriestley: blah blah..." instead of just "..." -- and looks less dumb. The real goal here is to introduce `DiffInline` and continue moving stuff from the tangled jungle of a million top-level behaviors to sensible smooth statefulness. Test Plan: - Hid and revealed inlines in unified and two-up modes. - These look pretty junk for now: {F4948659} Reviewers: chad Reviewed By: chad Maniphest Tasks: T12616, T12153 Differential Revision: https://secure.phabricator.com/D17861	2017-05-16 06:19:56 -07:00
epriestley	63450cc48e	Remove "Show All Context" button from Diffusion Summary: Ref T12616. Diffusion, only, has a "Show All Context" button which expands the full context on all changes. I don't remember the exact history on this, but it hasn't existed in Differential for some time and no one has complained. I suspect that the "View Options > Show All Context" on each file may replace it. I can't really come up with good reasons to use it, offhand. If we want to restore it, I think global options after T1591 is promising. {F4945561} Test Plan: - Loaded a commit in Diffusion, no longer saw a button. - Grepped for relevant sigils. Reviewers: chad Reviewed By: chad Maniphest Tasks: T12616 Differential Revision: https://secure.phabricator.com/D17843	2017-05-16 06:17:52 -07:00
epriestley	bcd87e0e3f	Don't apply patches or mark patches applied with `bin/storage upgrade --dryrun` Summary: Fixes T12682. Test Plan: Ran `bin/storage upgrade --dryrun` repeatedly with un-applied patches, saw it not apply them and not mark them applied. Reviewers: chad Reviewed By: chad Maniphest Tasks: T12682 Differential Revision: https://secure.phabricator.com/D17837	2017-05-05 19:57:18 -07:00
epriestley	85ff1d5c2d	Reduce the impact of `bin/storage dump` Summary: Ref T12646. - Use "wb1" instead of "wb" to use level 1 gzip compression (faster, less compressy). Locally, this went about 2x faster and the output only grew 4% larger. - LinesOfALargeExecFuture does a lot of unnecessary string operations, and can boil down to a busy wait. The process is pretty saturated by I/O so this isn't the end of the world, but just use raw ExecFuture with FutureIterator so that we wait in `select()`. - Also, nice the process to +19 so we try to give other things CPU. Test Plan: - Ran `bin/storage dump --compress --output ...`. - Saw CPU time for my local database drop from ~240s to ~90s, with a 4% larger output. Most of this was adding the `1`, but the ExecFuture thing helped a little, too. - I'm not sure what a great way to test `nice` in a local environment is and it's system dependent anyway, but nothing got worse / blew up. - Used `gzcat \| head` and `gzcat \| tail` on the result to sanity-check that everything was preserved. Reviewers: chad, amckinley Reviewed By: chad Maniphest Tasks: T12646 Differential Revision: https://secure.phabricator.com/D17795	2017-04-26 12:08:59 -07:00
epriestley	d0e6bf831d	Add "%I" (instance name) to application log formats Summary: Ref T12611. Currently, the HTTP/SSH logs don't have an option to include the instance name. Add such an option. Leave it out of the default logs because most installs don't use this. Test Plan: See next changes. Reviewers: chad, amckinley Reviewed By: chad Maniphest Tasks: T12611 Differential Revision: https://secure.phabricator.com/D17776	2017-04-23 11:07:19 -07:00
epriestley	5c1e4488de	Remove all "Phabricator Bot" code Summary: Closes T7829 as wontfix. Closes T7965 as wontfix. Closes T7800 as wontfix. Closes T2731 as wontfix. Closes T1271 as wontfix. We aren't maintaining this at all (see, e.g., T7829) and a user reported a technically accurate security issue via HackerOne: <https://hackerone.com/reports/222870> Just throw it away until we get to the eventual Conphernece bot/API update and can do this stuff correctly. Test Plan: Grepped for `phabricatorbot`. Reviewers: chad Reviewed By: chad Maniphest Tasks: T7965, T7829, T7800, T2731, T1271 Differential Revision: https://secure.phabricator.com/D17756	2017-04-21 12:48:35 -07:00
Austin McKinley	febd68039f	Add initial infrastructure for adding ModularTransaction support to Application config changes Summary: Part of the groundwork for T11476. Test Plan: ran `./bin/storage upgrade` and observed expected DB tables Reviewers: epriestley Reviewed By: epriestley Subscribers: Korvin Maniphest Tasks: T11476 Differential Revision: https://secure.phabricator.com/D17736	2017-04-19 15:44:57 -07:00
epriestley	3245e74f16	Show users how fulltext search queries are parsed and executed; don't query stopwords or short tokens Summary: Depends on D17670. Fixes T12137. Fixes T12003. Ref T2632. This shows users a readout of which terms were actually searched for. This also drops those terms from the query we submit to the backend, dodging the weird behaviors / search engine bugs in T12137. This might need some design tweaking. Test Plan: {F4899825} Reviewers: chad Reviewed By: chad Maniphest Tasks: T12137, T12003, T2632 Differential Revision: https://secure.phabricator.com/D17672	2017-04-12 19:07:54 -07:00
epriestley	cb49acc2ca	Update Phabricator to use intermediate tokens from the query compiler Summary: Depends on D17669. Ref T12137. Ref T12003. Ref T2632. Ref T7860. Converts Phabricator to the new parse + compile workflow with intermediate tokens. Also fixes a bug where searches for `cat"` or similar (unmatched quotes) wouldn't produce a nice exception. Test Plan: - Fulltext searched. - Fulltext searched in Conpherence. - Fulltext searched with bad syntax. Reviewers: chad Reviewed By: chad Maniphest Tasks: T12137, T12003, T7860, T2632 Differential Revision: https://secure.phabricator.com/D17670	2017-04-12 19:07:33 -07:00
Chad Little	5dd18a7ec1	Modernize PhortuneAccount with EditEngine/Modular Transactions Summary: This updates the backend of PhortuneAccount to use EditEngine and Modular Transactions and updates language to "account manager" for clarity of role. Test Plan: - Wiped `phortune_account` table - Visit Phortune, see new account automatically created. - Edit name and managers - Try to set no name or remove myself as a manager, get error messages - Visit `/phortune/` and create another new account Reviewers: epriestley Reviewed By: epriestley Subscribers: Korvin Differential Revision: https://secure.phabricator.com/D17585	2017-04-11 12:33:15 -07:00
Chad Little	28941b3105	Update PhortuneMerchant to Modular Transactions Summary: Modernize PhortuneMerchant for Modular Transactions. Also changed the language of "Members" to "Managers", which I think fits better given the power/capability. Test Plan: - Create a new Merchant - Test not filling in a name, see error - Test removing myself, see error - Edit an existing Merchant - Add new managers - Test removing myself, see error - Replace Picture - Update various fields, contact info, email, footer - Verify transactions are now nice and pretty Reviewers: epriestley Reviewed By: epriestley Subscribers: Korvin Differential Revision: https://secure.phabricator.com/D17584	2017-04-11 09:32:12 -07:00
epriestley	d1421bc3a1	Add "bin/storage optimize" to run OPTIMIZE TABLE on everything Summary: Even with `innodb_file_per_table` enabled, individual table files on disk don't normally shrink. For most tables, like `maniphest_task`, this is fine, since the data in the table normally never shrinks, or only shinks a tiny amount. However, some tables (like the "worker" and "daemon" tables) grow very large during a huge import but most of the data is later deleted by garbage collection. In these cases, this lost space can be reclaimed by running `OPTIMIZE TABLE` on the tables. Add a script to `OPTIMIZE TABLE` every table. My primary goal here is just to reduce storage pressure on `db001` since there are a couple of "import the linux kernel" installs on that host wasting a bunch of space. We're not in any trouble, but this should buy us a good chunk of headroom. Test Plan: Ran `bin/storage optimize` locally and manually ran `OPTIMIZE TABLE` in production, saw tables get optimized. Reviewers: chad Reviewed By: chad Subscribers: cspeckmim Differential Revision: https://secure.phabricator.com/D17640	2017-04-08 15:15:49 -07:00
epriestley	3d816e94df	Rename "PhabricatorHash::digest()" to "weakDigest()" Summary: Ref T12509. This encourages code to move away from HMAC+SHA1 by making the method name more obviously undesirable. Test Plan: `grep`, browsed around. Reviewers: chad Reviewed By: chad Maniphest Tasks: T12509 Differential Revision: https://secure.phabricator.com/D17632	2017-04-06 15:43:33 -07:00
epriestley	3a3626834e	Replace Remarkup calls to `PhabricatorHash::digest()` with SHA256 Summary: Ref T12509. Many of the calls to HMAC+SHA1 are just to compute cachekeys for remarkup objects. Make these use HMAC+SHA256 instead. There is no downside to swapping these since they just cause a cache miss in the worst case. I also plan to get rid of `PhabricatorMarkupInterface` eventually, but this doesn't go that far. Test Plan: Browsed some different types of documents (tasks, legalpad documents, phame blogs / posts, pholio mocks, etc). Reviewers: chad Reviewed By: chad Maniphest Tasks: T12509 Differential Revision: https://secure.phabricator.com/D17631	2017-04-06 15:43:18 -07:00
epriestley	d450a08890	Support HMAC+SHA256 with automatic key generation and management Summary: Ref T12509. This adds support for HMAC+SHA256 (instead of HMAC+SHA1). Although HMAC+SHA1 is not currently broken in any sense, SHA1 has a well-known collision and it's good to look at moving away from HMAC+SHA1. The new mechanism also automatically generates and stores HMAC keys. Currently, HMAC keys largely use a per-install constant defined in `security.hmac-key`. In theory this can be changed, but in practice essentially no install changes it. We generally (in fact, always, I think?) don't use HMAC digests in a way where it matters that this key is well-known, but it's slightly better if this key is unique per class of use cases. Principally, if use cases have unique HMAC keys they are generally less vulnerable to precomputation attacks where an attacker might generate a large number of HMAC hashes of well-known values and use them in a nefarious way. The actual threat here is probably close to nonexistent, but we can harden against it without much extra effort. Beyond that, this isn't something users should really have to think about or bother configuring. Test Plan: - Added unit tests. - Used `bin/files integrity` to verify, strip, and recompute hashes. - Tampered with a generated HMAC key, verified it invalidated hashes. Reviewers: chad Reviewed By: chad Maniphest Tasks: T12509 Differential Revision: https://secure.phabricator.com/D17630	2017-04-06 15:42:59 -07:00
epriestley	08a4225437	Provide "bin/files integrity" for debugging, maintaining and backfilling integrity hashes Summary: Ref T12470. Provides an "integrity" utility which runs in these modes: - Verify: check that hashes match. - Compute: backfill missing hashes. - Strip: remove hashes. Useful for upgrading across a hash change. - Corrupt: intentionally corrupt hashes. Useful for debugging. - Overwrite: force hash recomputation. Users normally shouldn't need to run any of this stuff, but this provides a reasonable toolkit for managing integrity hashes. I'll recommend existing installs use `bin/files integrity --compute all` in the upgrade guidance to backfill hashes for existing files. Test Plan: - Ran the script in many modes against various files, saw expected operation, including: - Verified a file, corrupted it, saw it fail. - Verified a file, stripped it, saw it have no hash. - Stripped a file, computed it, got a clean verify. - Stripped a file, overwrote it, got a clean verify. - Corrupted a file, overwrote it, got a clean verify. - Overwrote a file, overwrote again, got a no-op. Reviewers: chad Reviewed By: chad Maniphest Tasks: T12470 Differential Revision: https://secure.phabricator.com/D17629	2017-04-06 15:42:43 -07:00
epriestley	7e6f37fffb	Rename "ElasticSearch" filenames to "Elasticsearch" (2/2) Sometimes git does some odd magic on case-insensitive filesystems, try to trick it. Auditors: chad	2017-04-02 14:59:36 -07:00
epriestley	a9e2732a5c	Spell "Elasticsearch" correctly, not "ElasticSearch" Summary: Ref T12450. These are like 95% my fault, but Elastic appears to spell the name "Elasticsearch" consistently in their branding. Test Plan: `grep ElasticSearch` Reviewers: chad, 20after4 Maniphest Tasks: T12450 Differential Revision: https://secure.phabricator.com/D17601	2017-04-02 14:58:59 -07:00
epriestley	304d19f92a	After a fulltext write to a particular service fails, keep trying writes to other services Summary: Ref T12450. Currently, if a write fails, we stop and don't try to write to other index services. There's no technical reason not to keep trying writes, it makes some testing easier, and it would improve behavior in a scenario where engines are configured as "primary" and "backup" and the primary service is having some issues. Also, make "no writable services are configured" acceptable, rather than an error. This state is probably goofy but if we want to detect it I think it should probably be a config-validation issue, not a write-time check. I also think it's not totally unreasonable to want to just turn off all writes for a while (maybe to reduce load while you're doing a background update). Test Plan: - Configured a bad ElasticSearch engine and a good MySQL engine. - Ran `bin/search index ... --force`. - Saw MySQL get updated even though ElasticSearch failed. Reviewers: chad, 20after4 Reviewed By: 20after4 Maniphest Tasks: T12450 Differential Revision: https://secure.phabricator.com/D17599	2017-04-02 13:47:52 -07:00
epriestley	515cb98819	When running unit tests, ignore any custom task fields Summary: If you have `maniphest.custom-field-definitions` set to include "required" fields, a bunch of tests which create tasks can fail. To avoid this, reset this config while running tests. This mechanism should probably be more general (e.g., reset all config by default, only whitelist some config) but just fix this for now since it's a one-liner and doesn't make eventual cleanup any harder. Test Plan: Ran `arc unit`, hitting tests that create tasks. Reviewers: chad, 20after4 Reviewed By: chad Differential Revision: https://secure.phabricator.com/D17595	2017-04-02 09:36:17 -07:00
Daniel Stone	1c5503cb29	Custom fields: Render 'required' for tokenizer fields Summary: When building a tokenizer-based edit control for a custom field (e.g. a datasource type), preserve a field validation error whilst building edit controls. Test Plan: - Create custom datasource field, set it to required - Observe that 'Required' does not appear next to control - Apply patch - Observe 'Required' appears next to control Reviewers: #blessed_reviewers, epriestley Reviewed By: #blessed_reviewers, epriestley Subscribers: epriestley Differential Revision: https://secure.phabricator.com/D17592	2017-04-02 15:26:26 +00:00
Mukunda Modell	cb1d904654	Make sure writes go to the right cluster Summary: Two little issues 1. there was an extra call to getHostForWrite, 2. The engine instance was shared between multiple service definitions so it was overwriting the list of writable hosts from one service with hosts from another. Test Plan: tested in wikimedia production with multiple services defined like this: ```language=json [ { "hosts": [ { "host": "search.svc.codfw.wmnet", "protocol": "https", "roles": { "read": true, "write": true }, "version": 5 } ], "path": "/phabricator", "port": 9243, "type": "elasticsearch" }, { "hosts": [ { "host": "search.svc.eqiad.wmnet", "protocol": "https", "roles": { "read": true, "write": true }, "version": 5 } ], "path": "/phabricator", "port": 9243, "type": "elasticsearch" } ] ``` Reviewers: #blessed_reviewers, epriestley Reviewed By: #blessed_reviewers, epriestley Subscribers: epriestley Differential Revision: https://secure.phabricator.com/D17581	2017-03-30 18:08:05 +00:00
Mukunda Modell	654f0f6043	Make messages translatable and more sensible. Summary: These exception messages & comments didn't quite match reality. Fixed and added pht() around a couple of them. Test Plan: I didn't test this :P Reviewers: epriestley, #blessed_reviewers Reviewed By: epriestley, #blessed_reviewers Subscribers: Korvin Differential Revision: https://secure.phabricator.com/D17578	2017-03-28 23:17:35 +00:00
epriestley	88798354e8	Soften a possible cluster search setup fatal Summary: Ref T12450. The way that config repair and setup issues interact is kind of complicated, and if `cluster.search` is invalid we may end up using `cluster.search` before we repair it. I poked at things for a bit but wasn't confident I could get it to consistently repair before we use it without doing a big messy change. The only thing that really matters is whether "type" is valid or not, so just put a slightly softer/more-tailored check in for that. Test Plan: - With `"type": "elastic"`, loaded setup issues. - Before patch: hard fatal. - After patch: softer fatal with more useful messaging. {F4321048} Reviewers: chad Reviewed By: chad Maniphest Tasks: T12450 Differential Revision: https://secure.phabricator.com/D17576	2017-03-28 15:28:16 -07:00
epriestley	5f939dcce0	Re-run config validation from `bin/search` Summary: Ref T12450. Normally, we validate config when: - You restart the webserver. - You edit it with `bin/config set ...`. - You edit it with the web UI. However, you can also change config by editing `local.json`, `some_env.conf.php`, a `SiteConfig` class, etc. In these cases, you may miss config warnings. Explicitly re-run search config checks from `bin/search`, similar to the additional database checks we run from `bin/storage`, to try to produce a better error message if the user has made a configuration error. Test Plan: ``` $ ./bin/search init Usage Exception: Setting "cluster.search" is misconfigured: Invalid search engine type: elastic. Valid types are: elasticsearch, mysql. ``` Reviewers: chad, 20after4 Reviewed By: 20after4 Maniphest Tasks: T12450 Differential Revision: https://secure.phabricator.com/D17574	2017-03-28 14:53:26 -07:00
epriestley	8879118b69	Fix a mid-air collision around SearchService roles My D17571 didn't interact nicely with D17564, which added callsites for one of the methods I removed. Auditors: 20after4	2017-03-28 14:01:45 -07:00
epriestley	c40be811ea	Fix isReadable() and isWritable() in SearchService Summary: Ref T12450. Minor cleanup: - setRoles() has no callers. - getRoles() has no callers (these two methods are leftovers from an earlier iteration of the change). - The `hasRole()` logic doesn't work since nothing calls `setRole()`. - `hasRole()` has only `isreadable/iswritable` as callers. - The `isReadable()/isWritable()` logic doesn't work since `hasRole()` doesn't work. Instead, just check if there are any readable/writable hosts. `Host` already inherits its config from `Service` so this gets the same answer without any fuss. Also add some read/write constants to make grepping this stuff a little easier. Test Plan: - Grepped for all removed symbols, saw only newer-generation calls in `Host`. - See next diff for use of `isWritable()`. Reviewers: chad, 20after4 Reviewed By: 20after4 Maniphest Tasks: T12450 Differential Revision: https://secure.phabricator.com/D17571	2017-03-28 13:58:46 -07:00
Mukunda Modell	699228c73b	Address some New Search Configuration Errata Summary: [ ] Write an "Upgrading: ..." guidance task with narrow instructions for installs that are upgrading. [ ] Do we need to add an indexing activity (T11932) for installs with ElasticSearch? [ ] We should more clearly detail exactly which versions of ElasticSearch are supported (for example, is ElasticSearch <2 no longer supported)? From T9893 it seems like we may //only// have supported ElasticSearch <2 before, so are the two regions of support totally nonoverlapping and all ElasticSearch users will need to upgrade? [ ] Documentation should provide stronger guidance toward MySQL and away from Elastic for the vast majority of installs, because we've historically seen users choosing Elastic when they aren't actually trying to solve any specific problem. [ ] When users search for fulltext results in Maniphest and hit too many documents, the current behavior is approximately silent failure (see T12443). D17384 has also lowered the ceiling for ElasticSearch, although previous changes lowered it for MySQL search. We should not fail silently, and ideally should build toward T12003. [ ] D17384 added a new "keywords" field, but MySQL does not search it (I think?). The behavior should be as consistent across MySQL and Elastic as we can make it. Likely cleaner is giving "Project" objects a body, with "slugs" and "description" separated by newlines? [ ] `PhabricatorSearchEngineTestCase` is now pointless and only detects local misconfigurations. [ ] It would be nice to build a practical test suite instead, where we put specific documents into the index and then search for them. The upstream test could run against MySQL, and some `bin/search test` could run against a configured engine like ElasticSearch. This would make it easier to make sure that behavior was as uniform as possible across engine implementations. [ ] Does every assigned task now match "user" in ElasticSearch? [x] `PhabricatorElasticFulltextStorageEngine` has a `json_encode()` which should be `phutil_json_encode()`. [ ] `PhabricatorSearchService` throws an untranslated exception. [ ] When a search cluster is down, we probably don't degrade with much grace (unhandled exception)? [ ] I haven't run bin/search init, but bin/search index doesn't warn me that I may want to. This might be worth adding. The UI does warn me. [ ] bin/search init warns me that the index is "incorrect". It might be more clear to distinguish between "missing" and "incorrect", since it's more comforting to users to see "everything is as we expect, doing normal first-time setup now" than "something is wrong, fixing it". [ ] CLI message "Initializing search service "ElasticSearch"" does not end with a period, which is inconsistent with other UI messages. [ ] It might be nice to let bin/search commands like init and index select a specific service (or even service + host) to act on, as bin/storage --ref ... now does. You can generally get the result you want by fiddling with config. [ ] When a service isn't writable, bin/search init reports "Search cluster has no hosts for role "write".". This is accurate but does not provide guidance: it might be more useful to the user to explain "This service is not writable, so we're skipping index check for it.". [x] Even with write off for MySQL, bin/search index --type task --trace still updates MySQL, I think? I may be misreading the trace output. But this behavior doesn't make sense if it is the actual behavior, and it seems like reindexAbstractDocument() uses "all services", not "writable services", and the MySQL engine doesn't make sure it's writable before indexing. [x] Searching or user fails to find task Grant users tokens when a mention is created, suggesting that stemming is not working. [x] Searching for users finds that task, but fails to find a task containing "per user per month" in a comment, also suggesting that stemming is not working. [x] Searching for maniphest fails to find task maniphest.query elephant, suggesting that tokenization in ElasticSearch is not as good as the MySQL tokenization for these words (see D17330). [x] The "index incorrect" warning UI uses inconsistent title case. [x] The "index incorrect" warning UI could format the command to be run more cleanly (with addCommand(), I think). refs T12450 Test Plan: * Stared blankly at the code. * Disabled 'write' role on mysql fulltext service. * Edited a task, ran search indexer, verified that the mysql index wasn't being updated. Reviewers: epriestley, #blessed_reviewers Reviewed By: epriestley, #blessed_reviewers Subscribers: Korvin Maniphest Tasks: T12450 Differential Revision: https://secure.phabricator.com/D17564	2017-03-28 20:19:38 +00:00
Mukunda Modell	e41c25de50	Support multiple fulltext search clusters with 'cluster.search' config Summary: The goal is to make fulltext search back-ends more extensible, configurable and robust. When this is finished it will be possible to have multiple search storage back-ends and potentially multiple instances of each. Individual instances can be configured with roles such as 'read', 'write' which control which hosts will receive writes to the index and which hosts will respond to queries. These two roles make it possible to have any combination of: * read-only * write-only * read-write * disabled This 'roles' mechanism is extensible to add new roles should that be needed in the future. In addition to supporting multiple elasticsearch and mysql search instances, this refactors the connection health monitoring infrastructure from PhabricatorDatabaseHealthRecord and utilizes the same system for monitoring the health of elasticsearch nodes. This will allow Wikimedia's phabricator to be redundant across data centers (mysql already is, elasticsearch should be as well). The real-world use-case I have in mind here is writing to two indexes (two elasticsearch clusters in different data centers) but reading from only one. Then toggling the 'read' property when we want to migrate to the other data center (and when we migrate from elasticsearch 2.x to 5.x) Hopefully this is useful in the upstream as well. Remaining TODO: * test cases * documentation Test Plan: (WARNING) This will most likely require the elasticsearch index to be deleted and re-created due to schema changes. Tested with elasticsearch versions 2.4 and 5.2 using the following config: ```lang=json "cluster.search": [ { "type": "elasticsearch", "hosts": [ { "host": "localhost", "roles": { "read": true, "write": true } } ], "port": 9200, "protocol": "http", "path": "/phabricator", "version": 5 }, { "type": "mysql", "roles": { "write": true } } ] Also deployed the same changes to Wikimedia's production Phabricator instance without any issues whatsoever. ``` Reviewers: epriestley, #blessed_reviewers Reviewed By: epriestley, #blessed_reviewers Subscribers: Korvin, epriestley Tags: #elasticsearch, #clusters, #wikimedia Differential Revision: https://secure.phabricator.com/D17384	2017-03-26 08:16:47 +00:00
epriestley	a41d158490	Only hibernate the Taskmaster after 15 seconds of inactivity Under some workloads, the taskmaster may hibernate and launch more rapidly than it should. Require 15 seconds of inactivity before hibernating. Also hibernate for longer. Auditors: chad	2017-03-25 05:01:32 -07:00
epriestley	2cda280cde	Make the default Trigger hibernation 3 minutes instead of 5 seconds The `min()` vs `max()` fix in D17560 meant that the Trigger daemon only hibernates for 5 seconds, so we do a full GC sweep every 5 seconds. This ends up eating a fair amount of CPU for no real benefit. The GC cursors should move to persistent storage, but just bump this default up in the meantime. Auditors: chad	2017-03-25 04:14:32 -07:00
epriestley	8b553d2f18	Allow taskmaster daemons to hibernate Summary: Ref T12298. Like PullLocal daemons, this allows the last daemon in the pool to hibernate if there's no work to be done, and awakens the pool when work arrives. Test Plan: - Ran `bin/phd debug task --trace`. - Saw the pool hibernate and look for tasks. - Commented on an object. - Saw the pool wake up and process the queue. Reviewers: chad Reviewed By: chad Maniphest Tasks: T12298 Differential Revision: https://secure.phabricator.com/D17559	2017-03-24 13:51:37 -07:00
epriestley	f13637627d	Improve daemon "waiting" message, config reload behavior Summary: Ref T12298. Two minor daemon improvements: - Make the "waiting" message reflect hibernation. - Don't trigger a reload right after launching. Test Plan: - Read "waiting" message. - Ran "bin/phd start", didn't see an immediate SIGHUP in the log. Reviewers: chad Reviewed By: chad Maniphest Tasks: T12298 Differential Revision: https://secure.phabricator.com/D17550	2017-03-24 08:32:08 -07:00

1 2 3 4 5 ...

1944 commits